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s  INTRODUCTION 

This  report  is  one  of  the  products  of  a  rather  broad  survey  of  the 
literature  on  human  factors  in  computer  systems.  The  survey  was  conducted 
under  research  contract  N00014-76-C-0866,  funded  by  the  Engineering  Psychology 
Programs  Division  of  the  Office  of  Naval  Research.  The  principal  purpdse 
of  the  survey  was  to  assess  the  current  state  of  our  knowledge  in  this^area, 
in  order  to  determine  whether  that  knowledge  is  adequate  to  support  the 
development  of  a  human  factors  guide  to  the  design  of  interactive  computer 
systems.  Although  this  report  provides,  to  some  extent,  a  descriptive  and 
critical  outline  of  the  literature  surveyed,  its  principal  focus  is  on  the 
issue  of  design  guidelines. 

THE  PROBLEM 

Human  factors  in  computer  systems  is  by  nature  a  highly  interdisciplin¬ 
ary  field.  As  a  result,  the  relevant  literature  is  widely  scattered.  Those 
who  would  perform  effective  application  work  in  this  area  must  be  aware  of  a 
broad  range  of  literature,  including:  (a)  the  general  literature  on  human 
factors,  as  well  as  that  concerned  specifically  with  computer  systems,  displays, 
data  entry,  and  specific  application  areas;  (b)  a  significant  segment  of  the 
basic  psychological  literature,  including  especially  the  areas  of  perception, 
information  processing,  and  cognitive  psychology;  and  (c)  a  significant 
segment  of  the  computer  science  literature,  including  especially  the  areas 
of  batch  and  interactive  languages,  display  and  input  devices  and  techniques, 
and  specific  application  areas. 

If  the  maintenance  of  such  a  broad  familiarity  with  the  literature  is 
difficult  for  the  specialist  in  this  area,  it  is  all  but  impossible  for  the 
typical  interactive  system  designer.  The  designer  is  usually  a  computer 
scientist  or,  less  frequently,  a  specialist  in  the  application  area  to  which 
computers  are  being  applied.  Although  the  designer  may  recognize  that  many 
design  decisions  have  human  factors  overtones,  he  or  she  has  typically  had 
little  or  no  exposure  to  the  psychological  or  human  factors  research  literature. 
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In  some  well  established  research  areas,  such  as  keyboard  design  and 
certain  physical  properties  of  displays,  guidelines  exist  which  are  reasonably 
good  and  fairly  detailed.  Such  guidelines  may  be  quite  helpful  in  the  design 
of  a  console  or  other  interface  device  for  a  system,  or  even  in  the  selection 
of  an  appropriate  off-the-shelf  input/output  device.  As  we  progress  toward 
the  more  central  issues  in  interactive  systems,  such  as  their  basic  informational 
properties,  user  aids,  and  dialogue  methods,  available  guidelines  become 
sketchy  and  eventually  nonexistent.  The  interactive  system  designer  is  given 
little  human  factors  guidance  with  respect  to  the  most  basic  design  decisions. 

In  fact,  the  areas  in  which  existing  guidelines  concentrate  are  often  not 
even  under  the  control  of  the  designer,  who  may  have  more  freedom  with  respect 
to  dialogue  and  problem-solving  aids  than  with  respect  to  terminal  design  or 
selection. 

Between  these  two  extremes  lies  a  middle  ground  in  which  a  few  attempts 
have  been  made  to  develop  guidelines.  Engel  and  Granda  (1975)  have  produced 
an  excellent,  but  very  brief,  set  of  guidelines  which  addresses  some  of  the 
dialogue  issues  and  touches  on  display  formatting.  The  International  Purdue 
Workshop  on  Industrial  Computer  Systems  (1975)  has  produced  a  document  intended 
for  use  by  designers  of  process  control  systems.  This  document  is  oriented 
around  the  design  process,  and  has  several  desirable  features,  but  it  is  un¬ 
fortunately  weak  in  the  human  factors  area.  Several  other  authors  have  written 
less  formal  discussions  which  deal  with  desirable  dialogue  properties  and 
related  matters.  No  guidelines  have  been  found,  however,  which  are  both 
extensive  and  sound. 

The  dearth  of  human  factors  guidelines  with  respect  to  interactive 
problem-solving  methods,  user  aids,  etc.,  comes  as  no  surprise.  The  con¬ 
centration  of  existing  guidelines  on  person-computer  interface  hardware  and 
low-level  dialogue  issues  clearly  reflects  the  historical  emphasis  of  the 
field.  Only  fairly  recently  has  a  really  strong  emphasis  on  cognitive 
issues  become  apparent  in  the  practice  of  human  factors.  The  shift  in 
emphasis  results  in  part  from  the  increasing  use  of  interactive  systems  to 


support  cognitive  tasks,  and  in  part  from  the  emergence  of  cognitive  psychology 
and  human  information  processing  as  stronger  areas  of  basic  behavioral  theory 
and  experimentation.  Undoubtedly,  it  also  reflects  the  increasing  maturity 
of  human  factors  as  a  professional  discipline. 

More  extensive  human  factors  guidance  for  interactive  system  design  is 
sorely  needed.  The  computer  science  community  is  becoming  increasingly  aware 
of  the  significance  of  human  factors  issues  in  interactive  systems,  but  the 
available  options  are  limited.  Newman  (1977)  is  undoubtedly  correct  in 
asserting  that  psychologists  and  system  designers  complement  one  another,  and 
that  the  most  effective  approach  is  a  direct  involvement  of  human  factors 
personnel  in  the  design  effort.  Yet  very  few  psychologists  are  sufficiently 
well-grounded  in  computer  science  to  achieve  acceptance  in  this  role,  and 
the  number  of  new  interactive  systems  being  developed  is  staggering.  Pew 
and  Rollins  (1976)  have  tried  an  approach  in  which  human  factors  personnel 
define  a  dialogue-specification  technique,  building  appropriate  human  factors 
constraints  into  a  specification  method  which  is  then  used  by  the  system 
designers.  This  approach  may  allow  a  limited  number  of  human  factors  personnel 
to  exert  a  larger  design  influence  than  is  possible  by  direct  involvement  in 
the  design.  It  is  not  clear,  though,  that  this  approach  is  applicable  in  really 
complex  systems,  or  cost-effective  in  small  systems.  In  any  case,  this  approach 
concentrates  on  the  form  of  the  dialogue,  and  does  not  address  more  basic  issues 
of  user  information  requirements,  selection  and  design  of  appropriate  problem¬ 
solving  aids,  etc. 

The  development  of  human  factors  guidelines  for  use  by  system  designers 
appears  to  be  the  only  viable  solution  of  this  problem.  As  has  already  been 
noted,  though,  existing  guidelines  are  somewhat  primitive.  The  principal  goal 
of  this  study  was  to  determine  whether  or  not  the  state  of  the  art  in  human 
factors  in  computer  systems  is  currently  adequate  to  support  the  development 
of  more  extensive  guidelines.  A  secondary  goal  was  to  determine  what  form 
such  guidelines  should  take,  if  indeed  their  development  is  feasible. 


THE  LITERATURE  SURVEY 


In  order  to  investigate  these  questions,  an  extensive  literature  survey 
was  undertaken.  A  companion  report  (Ramsey,  Atwood,  and  Kirshbaum,  1978) 
describes  the  survey  in  detail,  including  the  areas  covered  and  those  excluded, 
as  well  as  the  survey  method  employed.  That  report,  which  is  a  bibliography 
of  the  most  relevant  literature,  contains  descriptions  and  critical  commentaries 
on  564  papers. 

Briefly,  the  literature  surveyed  was  concerned  with  such  areas  as 
interactive  input  and  output  devices  and  techniques  (and,  to  a  limited  extent, 
batch  input/output  devices  and  techniques,  as  well);  people-computer  dialogue 
(including  languages  and  dialogue  methods);  analysis  of  user  and  task 
properties;  techniques  for  user  requirements  analysis,  system  design,  and 
people-computer  system  modeling  and  evaluation;  and  people-computer  problem 
solving.  In  an  effort  to  keep  the  survey  manaqeable,  some  areas  were  generally 
excluded  from  consideration  even  though  they  may  intersect  the  broad  area  of 
human  factors  in  computer  systems.  These  topics  include  personnel  selection, 
training,  and  management;  documentation;  social  and  organi zational  implications 
of  computers;  software  development;  and  literature  describing  specific  systems. 

An  important  aspect  of  the  survey  was  its  attempt  to  include  relevant 
articles  not  only  from  the  human  factors  literature,  but  also  from  the  basic 
literature  of  both  behavioral  science  and  computer  science.  If  the  ultimate 
goal  of  this  effort  is  the  development  of  guidelines  useable  by  interactive 
system  designers,  it  is  clear  that  one  must  consider  not  only  knowledge 
gained  through  behavioral  experimentation,  but  also  current  and  projected 
practices  in  people-computer  interaction. 

The  literature  survey  was  quite  extensive.  Some  20,000  citations  were 
considered,  of  which  more  than  4,000  were  found  to  be  at  least  loosely  relevant. 
The  original  intent  of  the  survey  was  to  be  quite  exhaustive. 


However,  resource  limitations,  as  well  as  difficulties  in  obtaining  copies 
of  a  few  papers,  have  precluded  detailed  consideration  of  some  papers  which 
are  probably  directly  relevant.  When  resource  limitations  have  been  an  issue, 
breadth  of  coverage  has  been  the  primary  criterion  for  selection  of  the  564 
papers  actually  considered  in  detail.  For  the  purposes  of  this  study, 
acquisition  of  a  sound  overview  of  the  literature  was  considered  more 
important  than  a  comprehensive  treatment  of  any  particular  specialty  area 
within  the  field. 

GUIDELINES  AND  THE  DESIGN  PROCESS 

When  the  phrase  "interactive  system  design"  is  used  in  this  paper, 
the  reference  is  to  the  process  whereby  the  functional  behavior  of  an 
interactive  system  is  determined.  The  decisions  made  in  this  design  process 
involve  questions  of  basic  system  informational  properties  from  the  user's 
viewpoint,  and  of  interactive  dialogue,  display  formatting,  etc.  The  design 
activity  with  which  this  paper  is  concerned  is  often  called  "requirements 
analysis"  or  "functional  system  design."  This  study  is  not  concerned  with 
the  later  "software  design"  process  in  which  the  design  decisions  involve 
software  modules,  files,  etc. 

The  emphasis,  among  developers  of  human  factors  guidelines,  has  been 
placed  heavily  on  quantitative  reference  handbooks.  That  emphasis  is  consis¬ 
tent  with  a  desire  for  high  qua  itative  precision  and  with  the  expressed 
preferences  of  designers  --  though  not  necessarily  interactive  computer  system 
designers  --  for  concise,  preferably  graphical,  reference  data  (e.g.,  Meister 
and  Farr,  1967).  For  such  handbooks  to  be  satisfactory,  however,  at  least  two 
conditions  must  be  satisfied. 

First,  the  handbook  must  be  based  on  an  extensive  collection  of  raw 
quantitative  data  pertaining  to  operator  performance  and  operator-system 
interaction.  This  requirement  has  led  to  widespread  advocacy  of  large-scale 
parametric  experimentation  (e.g.,  Granda,  1977;  Meister,  1977)  for  the  purpose 


of  constructing  such  a  data  base.  Such  an  undertaking  is  expensive,  but 
sorely  needed,  in  the  interactive  computer  systems  area,  as  well  as  in  many 
other  person-system  domains.  One  clear  conclusion  of  this  survey  effort  is 
that  quantitative  data  on  user  performance  and  user-system  interaction  are 
few  and  fragmented.  A  quantitative  human  factors  reference  handbook  for 
interactive  systems  design  appears  to  be  well  beyond  our  current  capability. 

There  is  a  second  assumption  underlying  the  desire  for  quantitative 
reference  handbooks,  however.  The  design  decisions  for  which  they  are  used 
must  be  explicitly  identified  by  the  designer.  In  "classical"  system  design 
situations,  this  may  be  a  reasonable  expectation.  When  a  design  engineer 
encounters  issues  such  as  level  of  illumination,  size  of  control  knobs,  or 
type  and  position  of  a  numerical  display,  the  design  decision  involved  tends 
to  be:  (1)  quantitative,  (2)  explicit,  and  (3)  clearly  associated  with  the 
"operator  interface",  and  therefore  closely  identifiable  with  human  factors. 
Even  here,  there  is  evidence  which  suggests  that  designers  may  proceed  with¬ 
out  consulting  readily  available  human  factors  handbooks  (Meister  and  Farr, 
1967). 


In  the  design  of  interactive  computer  systems,  virtually  every 
decision  which  affects  the  functional  behavior  of  the  system  has  direct 
human  factors  overtones.  This  claim  can  be  made  in  the  cases  of  automobiles, 
aircraft,  and  radios,  too,  but  only  in  a  much  weaker  sense.  In  a  system 
whose  basic  function  is  communication  with  a  user,  and  whose  basic  purpose 
often  is  to  assist  the  user  with  tasks  which  are  cognitive  or  informational 
in  nature,  human  factors  issues  pervade  the  entire  design  process. 

Furthermore,  the  important  design  decisions  may  not  be  explicitly 
recognized.  It  appears  that  the  primary  problem-solving  method  employed  in 
design  tasks  is  an  "analytic/synthetic"  approach.  This  approach  involves 
an  analysis  of  the  design  problem,  in  which  the  components  of  the  problem 
are  identified,  followed  by  the  synthesis  of  a  solution.  It  does  not 
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involve  an  explicit  search  for  alternative  solutions;  instead,  a  solution 
is  "synthesized"  based  on  pattern  recognition  and  the  use  of  components 
from  past  solutions  to  other  problems.  While  this  characterization  of 
designer  behavior  is  somewhat  speculative,  it  appears  quite  compatible  with 
a  variety  of  observations  of  an  empirical  (e.g.,  Meister  and  Farr,  1967), 
theoretical  (e.g.,  Simon,  1969),  or  pragmatic  (e.g.,  Granda,  1976)  nature. 

If,  in  interactive  systems,  the  design  decisions  which  involve  human 
factors  issues  are  pervasive,  implicit,  and  often  qualitative,  a  reference 
handbook  is  unlikely  to  be  effective.  However,  there  is  another  mechanism 
for  providing  human  factors  guidance  to  the  system  designer,  which  appears 
to  be  more  compatible  with  the  design  process  and  the  designer's  information 
needs.  It  also  appears  to  be  more  compatible  with  the  current  state  of  our 
knowledge.  This  approach  Is  the  design  guide. 

As  the  term  will  be  used  here,  a  "design  guide"  is  a  document  whose 
structure  parallels  the  major  steps  of  the  design  process  itself.  Although 
such  a  guide  should  be  indexed  sufficiently  that  it  can  be  consulted  with 
a  specific  question  after  that  question  has  been  identified,  that  is  not 
the  design  guide's  primary  mode  of  use.  It  is  intended  to  be  consulted 
during  the  analytic  process,  before  design  decisions  are  made.  Such  a  guide 
must  identify  issues,  suggest  alternatives,  and  present  (where  they  exist) 
hard  human-factors  data  at  the  point  in  the  design  process  at  which  this 
information  is  most  relevant.  To  make  the  concept  somewhat  more  concrete. 

Figure  1  suggests  some  possible  major  sections  which  might  be  contained  in 
such  a  document'. 

Clearly,  this  is  more  of  a  "how-to-do-it"  document  than  has  typically 
been  undertaken  by  human  factors  personnel.  Such  an  approach  appears  justified, 
however,  by  several  factors:  (1)  the  pervasiveness  of  human  factors  issues 
throughout  the  interactive  system  design  process;  (2)  the  likelihood  that  a 
reference  handbook  would  not  be  consulted  and  used  effectively;  (3)  greater 
compatibility  with  the  problem-solving  behavior  and  information  needs  of  the 
system  designer;  (4)  inadequacy  of  the  existing  human  factors  data  base  to 
support  development  of  a  quantitative  reference  handbook;  (5)  comprehensiveness. 
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Users:  their  behavior  in  general;  how  to  determine  the  properties  of  a 
particular  user  population;  the  implications  of  those  properties 
for  the  interactive  system. 

Tasks:  what  tasks  users  perform;  how  to  determine  tasks  involved  in  an 
appl ication. 

Requirements  analysis:  How  to  analyze  information  requirements;  how  to 
select  appropriate  types  of  problem-solving,  clerical  and  user 
support  aids;  allocation  of  basic  tasks  to  user  or  computer; 
modeling  of  user-system  interactions;  evaluation  of  basic  design. 

Interactive  dialogue:  properties  of  different  dialogue  types;  selection 
of  appropriate  dialogue  type(s);  detailed  design  of  corrmand 
language,  system  access  structures,  tutorial  aids,  etc. 

Output  devices  and  techniques:  properties  of  display  devices;  implications 

of  dialogue  method  for  display  device  selection;  selection  or  design 
of  display  device(s);  detailed  display  design,  formatting,  coding 
techniques,  etc. 

Input  devices  and  techniques:  properties  of  input  devices;  implications  of 
dialogue  methods  for  input  device  selection;  selection  or  design 
of  input  device(s) . 

Evaluation  of  system  performance:  use  of  subjective  evaluations,  objective 
performance  measures. 


Figure  1.  Possible  Section  Topics  for  a  Design  Guide 
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in  the  sense  that  this  format  allows  discussion  end  some  guidance  even  in 
areas  in  which  specific  experimental  data  are  lacking;  and  (6)  the  fact  that 
this  general  format  is  capable  of  encompassing  not  only  human  factors  data, 
but  also  applied  human  factors  methods. 

Ordinarily,  conclusions  are  presented  in  the  "Conclusions"  section  of 
a  report.  In  this  case,  however,  it  seems  necessary  to  forewarn  the  reader. 
The  following  sections  discuss  the  surveyed  literature  from  the  point  of  view 
of  its  applicability  to  design  guidelines.  The  literature  must  be  considered 
with  both  types  of  human  factors  guidance  in  mind.  Either  type  might  have 
been  supportable  by  the  literature,  or  both,  or  neither. 


USER  AND  TASK  PROPERTIES 


USER  NEEDS,  CAPABILITIES,  AND  LIMITATIONS 

As  Meister  (1976)  points  out  quite  lucidly,  an  interactive 
system  acquires  its  characteristics  from  the  goals  adopted  by  its 
designer.  If  the  system  is  to  be  effective,  the  system  goals  must 
be  compatible  with  the  needs  and  capabilities  of  the  individual  user. 

While  this  compatibility  may  be  achievable,  at  least  in  part,  through 
operator  selection  and/or  training,  it  is  its  achievement  through 
appropriate  system  design  that  is  of  concern  here. 

If  compatibility  between  the  system's  purpose  and  the  user's 
needs  and  capabilities  is  critical  to  system  success,  the  designer 
must  somehow  be  made  aware  of  those  needs  and  capabilities.  This 
does  not  appear  to  have  been  accomplished  in  any  effective  or  system¬ 
atic  way.  The  literature  most  commonly  used  by  interactive  system 
designers  contains  many  such  prescriptions  as,  "Make  the  system  easy 
to  use,"  but  contains  few  explicit  treatments  of  user  capabilities, 
limitations,  or  habits,  either  in  general  or  by  specific  user  population. 
The  systems  design  literature  does  contain  discussions  of  user  require¬ 
ments  analysis  methods  (see  next  section),  especially  interview  and 
simulation  methods,  but  the  use  of  these  techniques  does  not  obviate 
a  need  for  basic  data  concerning  user  behavior. 

Clearly,  the  behavioral  issues  involved  here  are  complex,  subtle, 
and,  in  many  respects,  poorly  understood.  At  the  most  basic  level  (from 
the  viewpoint  of  the  designer),  user  acceptance,  user  confidence,  and 
good  user-system  performance  are  all  desirable  or  even  necessary,  but 
may,  under  some  circumstances,  be  conflicting  goals.  For  example, 

Dickson  et  al  (1977)  compared  raw-data  displays  with  statistically  sum¬ 
marized  displays  in  a  management  information  system.  Although  better 
decisions  resulted  with  summary  displays,  the  decisions  took  longer, 
and  user  confidence  in  the  decisions  was  lower.  Artifical  lockout, 
in  which  the  user  is  required  to  wait,  (several  minutes)  between  requests 
for  computation,  has  been  found  to  improve  performance  in  a  problem-solving 
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task  (Boehm  et  al,  1971;  Seven  et  al,  1971),  but  was  also  found  to  cause 
considerable  user  dissatisfaction.  Franklin  and  Dean  (1974)  describe 
user  reactions  to  a  graphical  system  for  electronic  circuit  design.  The 
intended  users  (students)  reportedly  found  the  system  easy  to  use,  but 
avoided  using  it  because  they  were  intimidated  by  its  rapid  response  and 
the  belief  that  they  were  tying  up  expensive  computational  resources. 

In  the  face  of  extremely  complex  behavioral  issues,  ready-made 
prescriptions  often  cause  more  difficulties  than  they  correct.  It  would 
appear  to  be  necessary  to  convey  to  the  designer  a  better  understanding 
of  the  behavioral  issues  themselves,  and  of  the  variables  which  most  directly 
influence  user  behavior.  Carried  to  an  extreme,  this  implies  a  curriculum 
in  psychology.  There  does  appear  to  be  a  middle  ground,  however,  which  is 
tenable  and  potentially  effective. 

Strengths  and  Weaknesses  of  Humans  and  Computers 

A  simple  (and  often  simplistic)  approach  to  this  problem  is  the 
listing  of  strengths  and  weaknesses  of  both  humans  and  computers.  Such 
lists  are  common,  and  often  contain  such  prescriptions  as  "humans  are 
good  at  decision  making;  computers  are  good  at  computation."  Aside  from 
being  so  vague  as  to  contain  little  useful  information,  these  comparisons 
may  even  be  misleading  (see  Vaughn  &  Mavor,  1972). 

A  related,  but  much  more  satisfactory  approach  is  represented  by 
Miller  (1967).  Miller  describes  the  problem-solving  and  information¬ 
processing  capabilities  of  the  human  user  of  a  computer  system  in  a  very 
readable  way.  The  characterization  is  reasonably  good,  but  is  subject 
to  the  criticism  that  it  is  not  sufficiently  specific  to  be  of  clear 
relevance  to  the  designer.  The  only  obvious  way  of  making  such  discus¬ 
sions  more  specific  is  to  limit  the  discussion  to  specific  tasks  which 
might  be  performed  by  the  user.  Crawford  et  al  (1977)  take  a  worthwhile 
step  in  this  direction.  Real  progress  in  this  direction  will  require  a 
more  effective  taxonomy  of  operator  tasks  than  presently  exists.  As  a 
later  section  will  show,  this  is  currently  a  very  weak  area,  in  spite  of 
several  attempts  to  develop  such  taxonomies.  In  the  foreseeable  future, 
the  best  approach  may  be  a  description  of  operator  capabilities  and 
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limitations  with  respect  to  a  subset  of  fairly  explicit  tasks  associated 
with  a  particular  class  of  interactive  system  (e.g.,  message  handling, 
bibliographic  search). 

Behavior  of  Specific  User  Classes 

Investigations  of  the  behavior  and  needs  of  specific  user  classes 
are  rather  primitive,  at  present,  but  nonetheless  have  produced  results 
which  are  of  significance  to  the  designer.  The  most  informative  investi¬ 
gations  of  this  sort  have  been  the  reports  of  the  "MICA  survey"  (Eason, 

1976;  Eason  et  al,  1975;  Stewart,  1974,  1976b).  Three  major  user  classes 
have  been  subject  to  considerable  discussion  in  the  literature:  (1)  "naive" 
(or  computer- inexperienced)  users,  (2)  managers,  and  (3)  scientific  and 
technical  users.  Brief  discussions  of  some  major  behavioral  properties  of 
these  user  classes  are  shown  in  Figure  2.*  In  addition  to  these  user 
classes,  there  are  scattered  discussions  in  the  literature  of  consumers 
(viewed  as  indirect  system  users;  Ivergard,  1976),  non-English-speaking 
users  (Hanes,  1975),  handicapped  users  (Raitzer  et  al,  1976),  and  even 
illiterate  users  (Evans,  1976). 


Naive  Users 

The  naive  user  has  been  subject  to  considerable  discussion  in  the 
recent  literature.  The  interactive  system  designer,  who  is  necessarily 
rather  sophistica*ed  with  respect  to  computers,  often  has  great  diffi¬ 
culty  producing  a  Dasic  system  design,  and  particularly  a  detailed  dia¬ 
logue  design,  which  is  comprehensible  and  easily  useable  by  the  rela¬ 
tively  naive  user.  Aside  from  a  number  of  special  abilities  which  pro¬ 
grammers  probably  have  as  a  result  of  self-selection,  they  also  have  a 
great  deal  of  specialized  knowledge  of  system  behavior  which  affects  the 
way  in  which  they  interact  with  systems.  For  example,  an  understanding 
of  the  interfaces  among  operating  system  and  application  modules  often 


*  In  this  and  subsequent  tables,  a  single  asterisk  (*)  indicates  a  refer¬ 
ence  which  presents  survey  or  questionnaire  data  or  summarizes  experi¬ 
mental  results  concerning  the  topic  of  the  table.  A  double  asterisk 
(**)  indicates  a  reference  which  presents  user  performance  data  or  rela¬ 
tively  detailed  results  of  controlled  experimental  study(s)  on  the  topic 
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allows  sophisticated  users  to  correctly  interpret  otherwise  ambiguous 
error  messages  on  the  basis  of  context  cues  not  useable  by  the  naive 
user. 

While  the  design  community  is  aware  of  this  problem  in  general, 
discussions  of  appropriate  design  properties  are  often  quite  vague 
("make  the  system  easy  to  use").  With  appropriate  caveats,  much  more 
specific  guidance  is  possible.  For  example,  the  use  of  computer- 
initiated  dialogue  (in  which  the  computer  presents  alternatives,  ques¬ 
tions,  or  forms,  and  the  user  responds)  is  almost  always  preferable  for 
naive  users,  unless  fairly  sophisticated  knowledge-based  or  natural- 
language  techniques  are  used  (see  Thompson,  1969,  1971;  and  Martin,  1573 
for  discussion).  Such  practices  as  unnecessary  abbreviation,  and  command 
languages  with  positional  parameters,  probably  have  more  significant 
negative  effects  with  naive  users  than  with  experienced  users.  Engel 
and  Granda  (1975)  and  Nickerson  and  Pew  (1971)  offer  a  number  of  guide¬ 
lines  with  respect  to  command  language  design,  user  feedback,  error 
message  contents,  system  behavior  in  the  event  of  errors,  etc.,  which 
are  especially  applicable  to  naive  users.  Appropriate  use  of  embedded 
training  (see  later  section)  and  other  tutorial  features  distributed 
throughout  the  dialogue  can  be  of  assistance  with  naive  users. 

If  the  naive  user  is  a  one-time  user,  or  occasional  user,  of 
the  system,  the  use  of  a  simple,  clear,  computer-initiated  dialogue 
may  be  sufficient  to  ensure  satisfactory  results  (see,  for  example, 

Evans,  1976).  For  the  repeating  user,  however,  dialogue  simplicity 
may  be  a  two-edged  sword.  In  one  of  the  few  empirical  studies  of  naive 
users,  Eason  et  al  (1975)  report  that  overemphasis  on  dialogue  simpli¬ 
city  often  prevents  the  user  from  utilizing  the  full  computational 
power  of  the  system.  As  the  user  acquires  experience  with  the  system, 
the  constraints  imposed  by  the  dialogue  become  burdensome.  A  frequent 
practice  in  modern  systems  is  the  provision  of  two  dialogue  modes: 

(1)  a  simple,  highly  tutorial,  computer-initiated  mode  for  the  beginning 
or  naive  user,  and  (2)  a  highly  abbreviated,  user-initiated  mode  for  the 
experienced  or  computer-sophisticated  user.  Only  rarely  is  any  mechanism 
provided  for  achieving  a  smooth  transition  between  these  two  modes  as 
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the  user  acquires  more  experience.  The  findings  of  Eason  et  al  would 
suggest  that  this  may  be  a  significant  failing. 

Sophisticated  users,  including  system  designers,  are  often  unaware 
of  the  specialized  knowledge,  context  cues,  etc.,  which  they  use  in 
interacting  with  computer  systems.  It  is  conceivable  that  the  designer 
might  benefit  from  an  explicit  discussion  of  the  differences  between 
naive  and  sophisticated  users.  Ideally,  such  a  discussion  should  be 
based  on  empirical  analyses  of  information  utilization  by  both  classes 
of  user.  No  such  analyses,  and  no  such  comparative  discussion,  were 
found  in  the  survey. 

Managers 

The  term  "manager"  is  used  broadly  here,  and  includes  not  only 
business  executives,  but  also  military  commanders  and  others  in  an 
organizational  management  role.  Managers  are  usually  naive  users,  in 
the  sense  discussed  above,  but  possess  enough  additional  common  character¬ 
istics  to  warrant  consideration  as  a  separate  class.  In  fact,  the 
specialized  needs  of  these  users  has  spawned  a  large  literature  describing 
"lessons  learned”  and  systems  analysis  techniques  for  management  informa¬ 
tion  systems  (MIS's).  Once  again,  though,  little  has  been  done  in  the  way 
of  formal  empirical  study. 

In  interacting  with  a  computer  system,  the  user  incurs  certain  costs, 
not  only  in  dollars,  but  also  in  terms  of  personal  time  and  effort.  In 
return,  the  user  receives  certain  benefits,  which  may  include  improved 
solutions,  savings  of  time  and  effort  relative  to  other  solution  techni¬ 
ques,  etc.  Computer  users  in  general  appear  to  behave  as  if  they  perform 
implicit  cost-benefit  evaluations  based  on  their  own  perceptions  of  costs 
and  utilities  (see  Carbonell,  1967,  for  discussion).  Managers  are  no 
exception,  but  they  appear  to  place  a  particularly  high  negative  value 
on  personal  time  and  effort,  and  when  compared  to  other  user  classes, 
have  more  "discretion",  or  authority  to  reject  the  system  or  use  it 
differently  than  intended  (Eason,  1976).  These  two  factors  make  it 
particularly  difficult  to  "capture"  the  manager  as  a  direct  user  of  an 
interactive  system. 
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There  is  also  some  evidence  that  the  information  requirements 
of  managers  are  more  variable  than  those  of  most  user  classes.  Eason 
(1976)  has  noted  this  problem  in  connection  with  business  MIS's,  and 
Raichelson  (1972)  reports  a  similar  problem  with  command  and  control 
systems.  Both  suggest  that  greater  system  flexibility  is  required, 
but  differ  in  the  means  suggested  for  achieving  it.  It  is  especially 
interesting  to  note  that  past  MIS  failures  have  led  to  widespread 
advocacy  of  more  --  and  more  formal  —  system  analysis  techniques  aimed 
at  extracting  specific  information  requirements  from  managers  prior  to 
MIS  design.  While  these  methods  may  be  very  helpful  in  providing  the 
designer  with  a  general  understanding  of  the  environment,  Eason's  survey 
results  and  Raichelson' s  observations  would  suggest  that  this  approach 
cannot,  in  principle,  solve  the  basic  problem.  A  later  section  on  system 
flexibility  will  address  this  issue  further. 

Scientific  and  Technical  Users 

Scientists,  engineers,  etc.,  constitute  a  third  user  class  which 
has  received  some  specialized  attention.  Allen  (1969)  and  Back  (1972) 
discuss  the  information  requirements  of  such  users  from  the  point  of  view 
of  general  scientific  communication,  but  do  not  relate  those  needs  to 
the  use  of  interactive  tools.  On  the  basis  of  the  MICA  survey,  Stewart 
(1974)  indicates  that  most  specialist  users  report  difficulty  using 
on-line  systems  due  to  inadequate  workstations,  terminal  operation 
problems,  or  difficulties  with  software  and  operating  procedures.  In 
another  paper,  Stewart  (1976)  characterizes  these  users  as  utilizing 
computerized  tools  in  three  broad  ways:  (1)  use  of  predefined  analyti¬ 
cal  packages,  (2)  definition  of  specialized  programs  to  be  implemented 
by  others,  and  (3)  hands-on  programming.  In  fact,  it  may  be  the  response 
of  these  users  to  inappropriate  or  inadequate  automated  tools  which  most 
strongly  differentiates  them  from  other  classes. 

User  Responses  to  Inadequate  Systems 

Another  result  of  the  study  of  specialized  user  classes  has  been 


16 


the  development  of  a  gross  characterization  of  user  responses  to 
inappropriate  or  inadequate  systems.  A  list  of  common  responses, 
due  primarily  to  Eason  (1976)  and  Stewart  (1976)  is  presented  in 
Figure  3.  As  indicated  in  the  figure,  user  responses  vary  with  the 
role,  status,  and  knowledge  of  the  user.  For  example,  a  very  common 
response  of  managers  to  systems  which  fail  to  totally  satisfy  their 
needs,  or  are  simply  difficult  for  the  managers  to  operate,  is  the 
interposition  of  another  individual  between  the  manager  and  the 
system.  That  individual  is  responsible  for  direct  operation  of  the 
system  and  satisfaction  of  the  manager's  information  needs.  Eason 
contends  that  this  is  an  unsatisfactory  response,  as  it  increases 
the  effective  response  time  of  the  system,  and  the  role  of  inter¬ 
mediary  is  very  difficult  to  fill.  Alter  (1977),  on  the  other  hand, 
believes  that  the  use  of  an  expert  intermediary  may  be  a  desirable 
approach.  In  any  event,  the  designer  must  be  aware  of  the  possibility 
of  such  "distant  use".  If  the  system  is  designed  for  the  manager  as 
direct  user,  but  is  in  fact  operated  by  an  expert  intermediary,  the 
dialogue  method,  and  perhaps  other  system  properties,  are  almost 
certain  to  be  inappropriate. 

Scientific  and  engineering  users  tend  to  respond  to  inadequate 
tools  in  an  entirely  different  way,  becoming  progressively  more  in¬ 
volved  in  the  design,  and  even  programming,  of  the  systems  they  use. 
Having  observed  computer-aided  problem-solving  practices  in  this  user 
community,  Stewart  (1976)  suggests  that  this  is  an  undesirable  situation. 
Obviously,  this  can  be  a  costly  practice,  but  Stewart's  concern  is  more 
subtle.  He  suggests  that  those  who  become  heavily  involved  in  tool 
development  may  apply  their  tools  to  new  problems  whether  they  are 
appropriate  or  not.  The  resulting  psychological  set  may  interfere 
with  problem  solving,  and  may  even  influence  problem  selection. 

Statistics  on  Use  of  Time-Sharing  Systems 

There  exist  a  large  number  of  statistical  studies  of  user  be¬ 
havior  when  interacting  with  specific  interactive  systems.  Typically, 
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Figure  3.  User  Responses  to  Inadequate  System 


these  data  have  been  gathered  automatically  by  monitors  built  into  a 
time-sharing  operating  system.  Figure  4  describes  a  representative 
sampling  of  these  studies.  In  general,  the  data  reported  consist  of 
descriptive  statistics  concerning  the  temporal  properties  of  user 
sessions,  and  frequency  distributions  for  usage  of  particular  system 

processors  or  commands.  There  is  good  reason  to  believe  that  these 
data  are  not  very  general izable  from  one  system  to  another,  because 
of  very  large  variations  in  system  and  user  population  properties. 

These  statistical  summaries  are  ordinarily  obtained  for  use  in  re¬ 
sponse  time  and  system  utilization  planning  studies,  and  they  are 
typically  published  as  an  internal  report  by  the  institution  involved. 

It  is  conceivable  that  a  serious  attempt  to  collect  and  integrate  these 
reports  would  yield  useful  information.  With  a  few  exceptions  (e.g., 
Boies  &  Spiegel,  1973;  Carlisle,  1972),  the  discussions  are  purely 
descriptive,  and  provide  little  real  behavioral  insight. 

User  Errors  and  Error  Analysis 

Errors  constitute  an  aspect  of  behavior  which  is  of  particular 
importance  to  the  designer.  Very  large  reductions  in  error  frequency 
and  severity  can  be  achieved  if  the  system  design  is  based  on  an  ade¬ 
quate  understanding  of  the  types  and  causes  of  errors  which  occur  in 
the  use  of  the  system,  or  which  occur  elsewhere  in  the  process,  of  which 
the  system  is  a  part.  A  poorly  designed  system,  on  the  other  hand,  may 
not  only  fail  to  provide  assistance  to  the  user  in  particularly  error- 
prone  situations,  but  may  actively  induce  user  errors. 

The  standard  measures  used  in  human  performance  studies  are  speed 
and  accuracy,  and  almost  all  such  studies  contain  information  on  errors. 
These  data  are  often  highly  task-specific,  however,  and  the  integration 
of  such  information  is  no  easy  task.  Most  of  the  explicit  discussions 
of  user  errors  found  in  the  survey  deal  with  data  entry.  Rela¬ 
tively  few  discussions  were  found  of  errors  in  computer-aided  problem 
solving,  or  even  errors  related  to  interactive  dialogue,  and  no  inte¬ 
grative  summary  was  found  in  these  areas. 
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Figure  4.  Representative  Examples  of  Statistical  Studies 
of  User  Behavior  in  Time-Sharing  Systems 
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A  few  scattered  papers  deal  with  error-prone  properties  of 
query  languages  and  query  problems.  For  example,  some  constructs 
which  appear  to  be  error-prone  are  disjunction  and  negation  (L.  A. 
Miller,  1974),  re-expression  of  inequalities  (Gould  &  Ascher,  1975), 
and  logical  set  relations  (Thomas,  1976).  A  larger  --  though  not 
necessarily  more  useful  --  literature  exists  in  the  area  of  computer 
programming  errors  and  error-prone  programming- language  properties, 
but  that  literature  was  not  within  the  domain  of  the  survey. 

The  fact  that  data  entry  dominates  the  discussion  of  user 
errors  is  not  necessarily  discouraging.  Data  entry  is  a  major 
source  of  costly  errors,  and  this  literature  contains  several  good 
models  for  the  presentation  of  error  data.  There  appear  to  be  at 
least  four  forms  in  which  information  about  user  errors  might  use¬ 
fully  be  conveyed  to  the  designer.  The  first  is  a  simple  tabula¬ 
tion  of  error  rates  for  particular  tasks.  With  few  exceptions  -- 
possibly  Hinds  (1960)  error-rate  data  for  punched-card  data  entry, 
for  example  --  the  tasks  for  which  such  data  are  available  appear 
to  be  too  specialized,  or  too  vague,  to  provide  useful  information 
for  the  designer  in  raw-data  form.  This  probably  results  in  part 
from  the  absence  of  an  adequate  taxonomy  of  user  tasks.  If  such  a 
taxonomy  existed,  error  data  (and  other  performance  data)  expressed 
in  terms  of  such  tasks  might  be  directly  applicable. 

There  are  two  very  good  —  and  very  different  —  summaries 
which  deal  with  data  entry  errors  at  a  level  which  might  be  useful 
to  the  designer.  Seibel's  (1972)  review  is  not  explicitly  about 
errors,  but  discusses  alternative  data  entry  devices  and  procedures 
in  terms  of  their  effect  on  speed  and  errors,  and  is  probably  the 
best  existing  integration  of  the  information  available  on  alter¬ 
native  encoding  techniques,  transcription,  typing,  etc. 

Bailey's  tutorial  presentation  (Bell  Telephone  Laboratories, 
1973)  is  an  attempt  to  educate  the  system  designer  with  respect  to 
the  nature  and  causes  of  user  errors  in  data  entry  systems,  with  an 
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emphasis  on  the  corrective  measures  available  to  the  designer.  The 
paper  is  highly  readable,  but  this  approach  is  necessarily  rather 
verbose.  A  tutorial  presentation  of  this  sort  is  probably  capable 
of  exerting  more  influence  on  the  system  designer  who  has  no  human 
factors  experience  than  is  either  of  the  previously  discussed  pre¬ 
sentations. 

Given  that  error  data  do  not  yet  exist  for  most  user  tasks 
in  a  form  useable  by  the  designer,  procedures  for  error  analysis 
during  system  design  may  be  the  most  useful  information  which  can  be 
provided.  Nawrocki:  et  al  (1973)  describe  a  "qualitative"  approach 
to  error  analysis.  A  procedure  is  described  for  categorizing  errors, 
relating  them  to  the  components  and  processes  in  which  they  occur, 
and  determining  their  causes.  The  case  studies  by  which  the  technique  is 
illustrated  all  involve  data  entry,  but  the  technique  appears  more 
broadly  applicable.  Bailey  (1972)  describes  a  procedure  in  which  a 
very  detailed  simulation  of  a  system  is  conducted,  after  design  and 
preparation  of  complete  user  documentation,  training  materials,  etc., 
but  before  implementation.  The  emphasis  of  the  simulation  is  on  analy¬ 
sis  of  error  rates  and  causes,  and  the  technique  is  undoubtedly 
effective  for  this  purpose.  The  expense,  and  particularly  the  increased 
span  time  required  by  this  approach,  may  be  considered  prohibitive  by 
many. 

Clearly, conceptual  errors  by  the  user  are  much  more  difficult  to 
deal  with  than  are  simple  errors  of  encoding,  transcription,  etc.  If 
the  designer  is  aware  of  the  nature  of  errors  of  the  latter  type,  it 
is  often  possible  to  detect,  and  perhaps  even  to  correct,  the  errors 
by  automated  means  (see  Obermayer,  1977  and  Wasserman,  1973  for  two 
different  applications  of  automated  error  detection,  reporting,  and 
correction).  Even  in  the  case  of  data  entry,  though,  many  of  the 
errors  are  conceptual,  involving,  for  example,  erroneous  descriptions 
of  events.  Strub  (1975)  found  that  over  80%  of  the  errors  observed 
in  a  tactical  data  entry  task  were  not  detectable  by  existing  auto¬ 
mated  techniques.  Artif icial-intel ligence  techniques  may  eventually 
be  very  helpful  here,  but  are  presently  useful  only  in  highly  restricted 
task  domains. 


Individual  Differences 


Individual  differences  among  users,  involving  such  variables 
as  abilities,  acquired  skills,  general  background,  sex,  and  age,  may 
affect  performance  in  tasks  involving  interactive  systems,  and  may, 
therefore,  have  implications  for  system  design.  Various  researchers 
have  reported  that  performance  correlates  with  age,  attitude  measures, 
and  mechanical  aptitude  (Weltman  et  al ,  1973),  sex  and  typing  skill 
(Earl  &  Goff,  1965),  vocabulary  test  performance  (Morrill  et  al ,  1968), 
recency  of  training  and  class  standing  (Robins  et  al,  1974),  and  a 
wide  variety  of  other  variables.  Others  have  speculated  that  perform¬ 
ance  relates  to  cognitive  decision  style  (Strub  &  Levit,  1974;  Levit 
et  al ,  1975)  and  general  intelligence  (Carlisle,  1974),  but  have  not 
observed  such  performance  differences  in  subsequent  empirical  studies. 
Responses  to  certain  system  properties  ("interface  complexity",  Carlisle, 
1974;  "interface  flexibility",  Walther,  1973)  have  been  found  to  vary 
with  user  properties. 

The  principal  difficulty  with  making  use  of  such  data  is  that 
the  observed  effects  are  often  highly  task-specific.  Most  of  the  reports 
of  individual  difference  effects  are  the  results  of  investigations  of 
the  properties  or  performance  of  particular  systems.  Most  of  these 
studies  have  not  been  intended  as  explicit  investigations  of  individual 
differences,  and  few,  if  any,  of  them  have  involved  any  systematic 
variation  of  the  task  performed.  Yet  any  attempt  to  integrate  their 
results  leads  one  immediately  to  the  conclusion  that  the  relevant  in¬ 
dividual  difference  variables  are  strongly  dependent  on  the  particular 
task  performed.  Sometimes,  the  individual  difference  variables  are 
even  further  confounded,  as  when  female  clerk- typists  are  found  to 
perform  better  than  male  engineers  on  a  data  entry  task  involving  typing. 

Theologus  et  al  (1970)  have  studied  the  specific  ability  require¬ 
ments  of  a  number  of  jobs  (not  specific  tasks).  The  results  may  be 
useful  for  personnel  selection,  but  are  not  sufficiently  detailed,  in 
terms  of  tasks,  to  be  useful  to  the  designer,  nor  are  they  specific  to 
computer-system  related  jobs.  This  study  may,  however,  provide  some 
insight  into  task-ability  relationships. 


Given  the  sparcity  of  relevant  data  relating  individual 
differences  to  system  design  considerations,  it  would  appear  that 
nothing  quantitatively  definitive  can  currently  be  written  for 
the  designer  on  this  topic.  However,  Wylie  et  al  (1975)  have  pro¬ 
vided  a  good  tutorial  discussion  on  the  causes  and  effects  of  in¬ 
dividual  differences,  particularly  as  they  affect  the  design  process. 
This  discussion  is  written  so  as  to  be  useful  for  the  designer,  and 
may  be  the  best  we  can  do  at  present. 


TASK  PROPERTIES 

There  are  a  very  large  number  of  tasks  which  may  be  per¬ 
formed  by  the  users  of  interactive  computer  systems.  Some  of 
these  tasks  would  exist  independently  of  the  computer  system,  and 
are  supported  in  some  way  by  the  system,  while  others  (logging 
in,  for  example)  are  by-products  of  the  computer  system  itself. 

Human  performance  varies  strongly  with  task  properties.  Appropri¬ 
ate  automated  aids,  dialogue  methods,  etc.,  clearly  vary  with  the 
task.  Research  results  on  human  behavior  often  fail  to  generalize 
to  other  tasks,  .ven  when  those  tasks  are  superficially  similar  to 
the  experimental  task.  While  attempts  have  been  made  to  develop 
task  taxonomies,  few  existing  taxonomies  address  user  tasks  at  a 
level  which  is  useful  in  the  functional  design  of  interactive 
systems. 

Bennett  (1971)  suggests  that  a  "task  taxonomy  requires  at 
least  the  following  levels  of  units:  a  'job',  perhaps  definable 
as  a  fu'H-time  paid  position;  a  'function,1  a  subdivision  of  a 
job,  or  a  grouping  of  tasks  (such  as  the  research  portion  of  a  pro¬ 
fessor's  job);  a  'task',  an  undefined  but  fairly  specific  set  of 
activities  (for  example,  'calibrate  the  equipment'),  and  a  'subtask' 
a  subdivision  of  a  task  (e.g.,  'adjust  the  knob')".  Using  this 
terminology,  most  of  the  work  which  has  been  done  has  been  at  the 
"function-  or  even  "job"  level,  while  meaningful  discussions  of 
human  performance  must  take  place  at  the  "subtask"  or  "task"  level. 

Consider  "reading",  for  example.  For  many  purposes,  this 
would  be  considered  a  very  low-level  user  task.  And  yet  the  user 
may  be  reading  continuous  text  from  the  display  for  comprehension, 
or  reading  continuous  text  to  detect  errors,  or  searching  a  menu  for 
the  desired  comnand,  to  cite  a  few  alternatives.  There  is  evidence 
that  these  specific  tasks  may  vary  in  their  requirements  for  minimum 
display  resolution,  line  spacing,  use  of  upper  and  lower  case  charac¬ 
ters,  and  perhaps  other  variables. 
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Obviously,  the  development  of  a  taxonomy  of  user  tasks  down 
to  this  level  of  detail  would  be  quite  difficult.  It  may  only  be 
practical  to  attempt  such  taxonomization  within  sharply  delimited 
application'areas,  such  as  text  processing,  message  handling,  or 
surveillance.  However,  if  guidelines  are  to  exist  which  provide 
quantitative  data  concerning  human  performance,  and  if  such  data 
are  to  be  applied  by  the  designer,  some  common  understanding  of 
user  tasks  is  absolutely  necessary. 

Of  the  papers  surveyed,  several  contained  explicit  attempts 
to  taxonomize  user  tasks  at  one  level  or  another.  At  the  global  end, 
Theologus  et  al  (1970)  analyzed  jobs  in  terms  of  their  requirement 
for  specific  known  human  abilities.  This  paper  provides  some  useful 
insights,  but  is  strictly  at  the  "job"  level  (e.g.,  computer  programmer). 
R.  B.  Miller  (1965,  1969)  discusses  very  general  problem-solving  tasks, 
along  with  the  relationship  between  tasks  and  information  requirements. 
This  task  characterization  is  not  specific  enough  for  the  purposes 
outlined  here,  but  the  later  paper,  in  particular,  contains  a  good 
discussion  of  the  relevance  of  a  task  taxonomy  to  system  design. 

Nicholson  et  al  (1972)  attempt  to  taxonomize  user  tasks  in  Navy  inter¬ 
active  computer  systems.  Their  taxonomy  contains  general  interaction 
operations,  at  a  level  which  is  somewhere  between  "task"  and  "function", 
and  refers  to  some  problem-specific  tasks,  as  well.  This  would  appear 
to  be  a  good  way  to  start,  if  the  goal  is  a  general  user  task  taxonomy. 
Unfortunately,  the  result  doesn't  quite  reach  a  level  of  detail  ade¬ 
quate  for  meaningful  discussions  of  human  performance. 

Wylie  et  al  (1975)  consider  only  functions  performed  in  connect¬ 
ion  with  automated  surveillance  systems.  The  result  is  a  taxonomy  which 
appears  to  be  very  good,  and  achieves  sufficient  detail  to  support  a 
discussion  of  human  performance  and  automated  aids.  While  an  even 
more  detailed  task  breakdown  would  be  desirable,  this  study  appears 
to  be  a  notable  success  in  an  area  which  has  seen  few  really  useful 
products.  It  is  important  to  note  that  the  tasks  discussed  here  (e.g., 
"alerted  searching  for  undetected  lower  order  features")  are  very 


specific  to  surveillance,  and  have  little  in  common  with  the  same- 
level  tasks  involved  in,  for  example,  text  processing.  Restricting 
the  scope  of  inquiry  to  a  single  application  area  may  be  the  key  to 
success  in  task  taxonomization. 

There  is  a  second  advantage  to  the  development  of  separate 
task  taxonomies  for  specific  application  areas.  If  tasks  are  identi¬ 
fied  in  a  very  general  way,  the  burden  of  relating  them  to  the  require 
ments  of  a  specific  system  falls  —  perhaps  somewhat  heavily  --  on  the 
designer.  If  the  tasks  are  characterized  in  terms  of  a  particular 
application  area  or  process,  the  designer  may  find  it  much  easier  to 
recognize  and  relate  to  them.  Thus,  the  taxonomy  may  be  more  easily 
used,  and  a  need  for  formal  task  analyses,  by  the  designer,  may  even 
be  eliminated. 

In  summary,  our  lack  of  success  --  some  might  say  lack  of 
effort  —  in  characterizing  user  tasks  is  probably  the  greatest  exist¬ 
ing  deficiency  of  the  field.  There  is  some  indication  that  this  is 
a  manageable  problem,  however,  if  approached  with  a  limited  applica¬ 
tion  area  in  mind.  In  any  case,  progress  in  this  area  is  necessary 
if  we  are  to  provide  detailed,  quantitative  guidelines  which  allow 
the  designer  to  confidently  apply  the  results  of  human  factors  re¬ 
search  to  the  identification  of  appropriate  automated  aids  and  their 
detailed  specification. 
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DETERMINATION  OF  USER  REQUIREMENTS 
REQUIREMENTS  ANALYSIS  METHODS 

Specific  research  results,  comparing  input  devices  or  display 
features,  for  example,  represent  an  important  kind  of  contribution 
which  human  factors  practitioners  can  make  to  the  system  design  process. 

Before  such  data  can  be  applied,  however,  it  is  necessary  that  the 
designer  have  a  basic  conception  of  the  functional  requirements  of  the 
system.  The  previous  section  has  indicated  that  some  of  these  require¬ 
ments  may  be  suggested  by  an  isolated  consideration  of  the  properties 
of  the  task,  or  of  the  user.  All  too  often,  user  requirements  analysis 
is  no  more  detailed  or  formal  than  this.  Commonly,  the  designer  un¬ 
critically  accepts  the  user's  stated  preferences,  often  obtained  via 
interview,  integrates  that  information  with  the  designer's  own  prior 
conception  of  the  user's  task  and  work  habits,  and  proceeds  to  design 
the  system.  This  is  a  particularly  prevalent  scenario  in  those  cases 
in  which  the  designers  are  themselves  expert  in  the  application  domain 
of  the  system.  This  approach  can  produce  good  systems.  Its  success  is 
limited,  however,  by  the  validity  and  completeness  of  the  designer's 
conception  of  the  task,  the  validity  and  completeness  of  the  user's  view 
of  the  task,  the  ability  of  users  to  verbalize  their  knowledge,  and 
a  wide  variety  of  subjective  biases  which  may  influence  the  behavior  of 
the  user  or  the  designer. 

It  would  be  presumptuous  to  suggest  that  there  are  any  guaranteed  solu¬ 
tions  to  this  problem.  On  the  other  hand,  the  observation,  description, 
modeling,  and  improvement  of  human  performance  is  what  behavioral  re¬ 
search  is  all  about.  In  a  loose  sense,  user  requirements  analysis  is 
itself  a  behavioral  research  problem.  Many  of  the  observation  and 
analysis  techniques  developed  by  behavioral  scientists  for  use  in  basic 
research  have  been  effectively  adapted  and  applied  by  human  factors 
personnel  to  the  analysis  of  user  requirements  for  person-machine  systems. 

The  computer  systems  design  literature  contains  scattered  references  to 
these  methods,  but  suggests  that  few  designers  have  any  broad  awareness 
of  these  techniques,  or  of  their  strengths  and  pitfalls.  It  appears 
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appropriate,  then,  for  a  human  factors  design  guide  to  provide  metho¬ 
dological  assistance  to  the  designer. 

The  use  of  appropriate  requirements  analysis  techniques  can 
not  only  result  in  more  accurate  identification  of  user  requirements, 
but  can  also  serve  to  increase  the  involvement  of  the  user  in  the  de¬ 
sign  process.  While  this  can  improve  the  design  itself  (Eason,  1976, 
1977;  Kriebel,  1970;  cf.  Greenwald-Katz,  1976),  there  is  also  evidence 
that  it  results  in  greater  user  acceptance  of  the  resulting  system 
(Eason,  1977;  Igersheim,  1976).  Eason  (1977)  suggests  that  active 
involvement  of  users  in  the  design  process  can  also  result  in  delays, 
and  difficulty  in  translating  stated  user  wants  into  precise  specifi¬ 
cations,  which  can  in  turn  be  somewhat  frustrating  to  the  designer. 

It  is  not  clear,  however,  that  user  involvement  in  the  design  itself  is 
necessary  to  achieve  the  beneficial  effects.  It  seems  likely  that 
greater  designer-user  interaction  in  the  requirements  data  collection 
phase  is  a  desirable  compromise. 

Data  Collection 

Data  collection  is  the  first  step  of  the  user  requirements 
analysis  process.  Basic  approaches  to  data  collection  include  inter¬ 
views,  questionnaires  and  surveys,  field  observations,  simulation, 
and  gaming.  Figures  5,  6,  and  7  present  a  general  overview  of  some 
of  these  techniques,  with  comments  and  references.  In  most  cases, 
the  references  involve  applications  of  the  techniques  in  the  specific 
area  of  interactive  computer  systems  design.  In  a  few  cases,  however, 
the  reference  is  to  a  general  discussion  of  the  method,  since  no  de¬ 
tailed  discussion  of  the  technique  in  connection  with  interactive  com¬ 
puter  systems  was  found.  These  instances  are  identified  in  the 
"comments"  section  of  the  figures. 

The  computer  science  literature  contains  a  large  number  of 
papers  dealing  with  requirements  analysis.  In  general,  those  papers 
emphasize  the  kinds  of  information  the  designer  should  look  for  (e.g., 
mission,  environment,  organizational  structure,  decision  maker  style). 


but  often  at  a  relatively  high  level  (see,  for  example,  the  review 
of  management  information  requirements  analysis  techniques  by  Taggart 
and  Tharp,  1977;  see  also  Evans,  1967).  This  literature  provides 
relatively  little  discussion  about  how  the  designer  should  obtain  this 
information.  Most  of  the  emphasis  is  on  interviews  and  questionnaires, 
with  limited  discussion  of  more  objective,  empirical,  or  performance- 
based  techniques  (for  a  basic  discussion  of  interview  and  questionnaire 
techniques,  see  Parker,  1970). 

Some  specific  types  of  questionnaire  and  survey  methods  are 
discussed  in  Figure  5.  While  these  methods  can  be  useful  for  determin¬ 
ing  user  information  requirements,  their  uncritical  use  is  potentially 
hazardous.  The  principal  difficulty  is  the  subjective  nature  of  ques¬ 
tionnaires.  The  quality  of  the  result  is  necessarily  limited  by  the 
validity  and  completeness  of  the  users'  understanding  of  their  own 
behavior,  and  by  their  ability  to  verbalize  that  information.  Although 
the  users  may  be  quite  expert  at  performing  their  job,  they  are  rarely 
expert  at  analyzing  or  describing  how  they  do  the  job,  its  information 
requirements,  etc.  Even  an  unusually  insightful  and  verbally  fluent 
user  may  produce  a  misleading  characterization  of  a  task's  information 
requirements  because  of  subjective  biases.  Indeed,  such  biases  may 
easily  result  from  the  very  experiential  process  by  which  the  user 
acquires  expertise.  For  example,  often-repeated  procedures  may  be¬ 
come  "automatized"  so  that  they  cease  to  require  constant  attention. 
After  a  time,  the  user  may  be  completely  unaware  of  these  processes 
and  the  information  required  to  initiate  them.  In  some  cases,  the 
most  important  elements  of  the  user's  behavior  may  be  the  very  ones 
which  the  user  is  least  able  to  verbalize  without  assistance.  Further- 
more^urveys  of  user  requirements  may  be  biased  by  the  users'  pre¬ 
conceptions  concerning  the  capabilities  and  current  uses  of  computer 
systems. 

These  problems  are  alleviated  only  slightly  through  the  use  of 
interviews  (discussed  in  Figure  6).  Even  highly  structured  interviews 
are  heavily  dependent  on  the  ability  of  the  user  to  describe  the  task, 
information  requirements,  etc.  While  interviews  with  users  are  a  good 
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Figure  5.  Requirements  Analysis:  Quesionnalre  and  Survey  Methods 
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Figure  6.  Requirements  Analysis:  Interviews  and  Field  Observation 


source  of  preliminary  information,  they  provide  only  part  of  the  story. 
One  beneficial  function  of  a  design  guide  might  be  the  familiarization 
of  the  designer  with  more  objective  requirements  analysis  methods  based 
on  field  observations,  simulation  and  gaming. 

Direct  observation  of  user  behavior  during  actual  job  perform¬ 
ance  is  the  simplest  and  most  basic  of  these  approaches.  Standard  job 
analysis  techniques,  such  as  task  analysis,  link  analysis,  and  activity 
analysis,  can  provide  structure  to  such  field  observation  and  suggest 
what  to  look  for.  These  techniques  are  discussed  in  standard  human 
factors  texts,  but  are  rarely  described,  in  any  detail,  in  documents 
dealing  specifically  with  computer  systems  design.  With  appropriate 
instruction,  system  designers  could  employ  at  least  some  of  these  tech¬ 
niques  effectively. 

There  are  some  limitations  to  these  techniques,  however.  First, 
they  involve  observation  of  the  user's  existing  work  practices,  which 
may  not  correspond  to  those  involved  in  the  use  of  the  proposed  com¬ 
puter  aid.  It  is  easy  to  fall  into  the  trap  of  designing  a  computer 
system  which  enforces  current  --  and  perhaps  inappropriate  --  manual 
practices.  Second,  the  techniques  are  most  suited  for  observation  and 
analysis  of  manual  or  clerical  tasks,  in  which  the  relevant  user  be¬ 
havior  js  overtly  observable.  Cognitive,  or  problem-solving,  tasks 
are  difficult  to  interpret  validly  by  simple  observation. 

On  the  surface,  task  analysis  appears  to  be  a  simple,  descript¬ 
ive  technique.  To  be  useful,  however,  task  analyses  must  describe 
behavior  in  terms  of  units  of  activity  which  are  psychologically  mean¬ 
ingful,  with  regard  to  both  level  of  detail  and  type.  This  implies  that 
sensitivity  to,  and  probably  training  in,  behavioral  science  is  a  pre¬ 
requisite  to  effective  use  of  task  analysis.  This  prerequisite  is  often 
overlooked,  and  this  failing  may  account  for  many  unsatisfactory  appli¬ 
cations  of  the  technique.  It  is  not  clear  that  a  design  guide  can  con¬ 
vey  enough  information  to  allow  a  "psychologically  naive"  system  de¬ 
signer  to  employ  this  approach  well.  If  task  analysis  is  described  along 
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with  appropriate  caveats,  however,  it  might  well  be  emDloyed  more 
effectively  than  at  present,  and  the  interested  designer  might  be 
made  aware  of  a  need  for  additional  training.  It  should  be  noted 
that  a  design  guide  emphasizing  a  single  class  of  system  might  deal 
with  this  problem  by  presenting  a  description  of  candidate  tasks 
known  to  occur  in  such  systems,  provided  appropriate  task-analytic 
research  has  been  done. 

Unlike  field  observation  techniques,  simulation  and  gaming 
approaches s (Figure  7)  allow  the  designer  to  investigate  user  be¬ 
havior  in  novel  situations.  As  we  learn  more  about  automated  problem¬ 
solving  and  decision-making  aids,  it  is  becoming  apparent  that  optimal 
aided  approaches  may  differ  drastically  from  the  practices  of  the 
unaided  user.  Simulation  methods  not  only  allow  empirical  investi¬ 
gation  of  user  interaction  with  not-yet-exi stent  automated  aids,  but 
also  provide  a  mechanism  for  the  study  of  user  behavior  and  informa¬ 
tion  requirements  in  critical  situations  which  occur  only  rarely  in 
the  actual  application  environment. 

The  simplest  and  most  basic  simulation  method  is  "paper"  simu¬ 
lation,  in  which  the  user  interacts  with  a  nonexistent  system  --  hand- 
simulated  by  humans  using  whatever  computational  aids  are  available 
and  appropriate  --  while  solving  one  or  more  representative  problems. 

A  particular  advantage  of  this  approach  is  its  compatibility  with  very 
unstructured  simulations.  By  allowing  the  user  to  engage  in  aided 
problem  solving  with  minimal  constraints  on  both  the  dialogue  and 
the  types  of  information  which  can  be  requested,  it  is  often  possible 
to  gain  significant  insight  into  the  user's  natural  approach  to  the 
task  (see  Cross,  1969;  Ramsey,  1975).  A  secondary  benefit  reported 
by  Ramsey  results  from  involving,  in  the  simulation,  personnel  who 
will  eventually  be  involved  in  the  implementation  of  the  system. 

These  personnel  acquire  greater  understanding  of  the  user's  problems, 
as  well  as  possible  implementation  approaches,  and  are  often  more 
receptive  to  human  factors  inputs  after  such  a  simulation. 
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Figure  7.  Requirements  Analysis:  Simulation  and  Gaming 


"Protocol  analysis"  is  a  related  technique  which  has  become 
quite  popular  for  the  development  of  detailed  cognitive  process 
models,  and  is  beginning  to  see  application  in  computer  system  re¬ 
quirements  analysis.  In  this  technique,  the  user  solves  a  simu¬ 
lated,  or  actual,  problem  while  continuously  reporting  (either  orally 
or  in  written  form)  the  steps  in  the  solution,  their  rationale,  etc. 
This  procedure  can  be  highly  obtrusive,  in  two  ways.  First,  the  re¬ 
quirement  to  report  constantly  on  one's  activities  can  interfere 
with  the  ability  to  perform  the  task,  by  causing  the  total  workload 
to  exceed  the  memory  or  processing  resources  of  the  problem  solver. 
The  situation  is  very  similar  to  that  in  which  a  secondary  task  is 
imposed  in  order  to  determine  the  processing  load  associated  with  a 
primary  task.  Second,  the  requirement  to  report  the  nature  and  pur¬ 
pose  of  each  step  tends  to  emphasize  the  use  of  a  systematic  approach 
to  the  problem.  It  may  very  well  cause  the  problem  solver  to 
commit  to  a  particular  strategy,  or  a  particular  "move",  earlier 
than  in  an  unconstrained  setting.  There  is  also  a  danger,  with 
protocol  analysis,  that  the  emphasis  in  interpreting  the  results 
will  be  placed  on  the  problem  solver's  verbal  report  rather  than 
on  the  actual  steps  taken  in  solving  the  problem.  There  is 
experimental  evidence  that  these  do  not  always  correspond  (Best, 
1977).  In  spite  of  these  difficulties,  however,  and  others  men¬ 
tioned  in  Figure  7,  protocol  analysis  provides  much  more  detailed 
information  than  other  techniques,  and  has  been  used  with  reported 
success  in  connection  with  the  design  of  interactive  problem-solving 
aids  and  interactive  languages. 

Interactive  simulation,  in  which  the  user  interacts  directly 
with  a  computer  system  to  solve  simulated  problems,  can  be  a  very 
effective  technique  for  qatherinq  information  about  the  user's 
behavior  and  information  needs.  This  approach  allows  detailed  simu¬ 
lation  of  the  properties  of  the  actual  system  (specific  dialogue, 
response  time,  etc.).  There  are  some  obvious  difficulties,  however. 
Such  a  simulation  can  be  done  only  after  a  fairly  detailed  --  even 


it'  tentative  --  design  has  been  developed.  Thus,  interactive  simu¬ 
lation  is  more  appropriate  for  design  evaluation  and  iteration  than 
for  initial  user  requirements  data  collection.  In  fact,  the  tech¬ 
nique  is  usually  applied  only  after  the  design  is  actually  imple¬ 
mented,  at  least  in  skeleton  form.  The  alternative  —  simulation 
after  the  design  is  complete,  but  before  implementation  —  requires 
either  significant  resources  or  the  use  of  specialized  dialogue 
simulation  tools  which  arc  not  yet  widespread  (see  Lenorovitz  & 

Ramsey,  1977).  There  are  many  reports  of  beneficial  uses  of  inter¬ 
active  simulation,  however,  (Parsons,  1972,  briefly  discusses  several), 
and  it  should  be  considered  an  important  tool  in  the  overall  design 
process.  A  later  section  discusses  modeling  of  interactive  systems, 
a  topic  closely  related  to  interactive  simulation.  Many  of  the  papers 
discussed  there  also  bear  on  this  topic. 

In  summary,  there  are  several  techniques  for  user  requirements 
data  collection  and  analysis  which  may  be  more  objective  and  more 
effective  than  those  in  common  use  in  the  computer  science  community. 
Improved  designer  awareness  of  these  methods,  their  advantages  and 
pitfalls,  and  the  procedures  involved  in  their  use,  might  be  achieved 
by  a  design  guide  of  the  design-process-oriented  type.  It  is  not 
clear  that  a  treatment  of  these  topics  could  be  achieved  as  effect¬ 
ively  with  the  "reference  handbook"  approach,  which  usually  assumes 
that  the  designer  already  has  the  data  on  which  design  decisions  are 
to  be  based. 

Interactive  System  Modeling  Techniques 

Once  the  designer  has  collected  user  and  task  data,  it  is  some¬ 
times  advantageous,  as  the  next  step  in  the  design  process,  to  attempt 
to  model  the  interactive  system  that  is  being  designed.  In  the  ideal 
case,  a  simulation  model  of  the  interactive  system  can  be  constructed. 
This  would  allow  the  designer  to  examine  the  effects  of  alternative 
design  decisions  before  commiting  to  a  firral  design  structure.  At  the 
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very  least,  attempting  to  construct  a  model  helps  ensure  that  the 
designer  has  adequately  analyzed  user  and  task  properties,  since 
these  are  prerequisites  for  modeling  efforts. 

Depending  upon  the  tasks  that  the  interactive  system  is  to 
be  used  for  and  the  type  of  information  that  the  designer  is  attempting 
to  gain,  one  of  several  basic  types  of  models  could  be  used.  These 
approaches  differ  in  the  degree  to  which  they  might  reasonably  be 
employed  by  an  ordinary  system  design  team  to  aid  in  the  design  pro¬ 
cess.  In  particular,  some  of  the  techniques  depend  primarily  upon 
the  adequacy  of  the  task  analysis,  while  others  require  considerable 
modeling  sophistication  and/or  knowledge  of  human  behavior.  An  over¬ 
view  of  the  major  approaches  to  the  modeling  of  interactive  systems 
is  presented  in  Figure  8. 

The  current  literature  on  mode  Is  of  interactive  systems  can  be 
summarized  as  follows.  First,  all  of  the  types  of  models  shown  in 
Figure  8, with  the  exception  of  "models  of  human  information  processing," 
are  directed  toward  specific  task  domains.  Second,  modeling  efforts 
have  been  successful  only  when  the  modeler  has  a  good  understanding  of 
a  task  domain  and  of  user  behavior  in  that  domain.  Third,  models 
have  been  successful  only  in  rather  simple  domains.  Fourth,  no  gener¬ 
ally  applicable  models  exist  and  it  is  very  unlikely  that  such  models 
will  be  developed  in  the  near  future. 

In  the  paragraphs  below,  we  will  consider  each  of  the  major 
approaches  to  modeling  interactive  systems.  Following  this,  we  will 
briefly  consider  how  models  of  the  user  alone,  rather  than  of  the 
entire  user-computer  system,  can  be  used  to  develop  models  of  inter¬ 
active  systems.  Finally,  we  will  consider  issues  associated  with 
validating  models  of  interactive  systems. 

Network  Models 


A  prerequisite  for  developing  a  network  model  is  that  the 
tasks  performed  by  both  the  user  and  the  system  be  describable  in 
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Figure  8.  Major  Approaches  to  the  Modeling  of  Interactive  Systems  (Page  I  of  2). 
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terms  of  predecessor- successor  relationships.  It  is  also  advantageous 
if  the  tasks  are  fairly  independent.  In  this  case,  performance  data 
can  be  collected  on  each  task  individually,  with  little  concern  for 
the  performance  effects  of  interactions  among  tasks.  If  there  are 
interactions  among  tasks,  the  effects  on  performance  on  individual  tasks 
are  generally  very  complex,  not  easily  predicted,  and  affected  oy  a  large 
number  of  situation-specific  and  user-specific  factors.  In  this  case, 
collecting  performance  data  can  involve  a  great  deal  of  controlled,  em¬ 
pirical  work  (cf.,  Norman  and  Bobrow,  1975). 

Existing  network  models  of  interactive  systems  tend  to  be  limited 
to  tasks  that  have  a  linear  ordering  (e.g.,  Baker,  1970).  Although  a 
linear  model  clearly  satisfies  the  requirement  of  having  predecessor- 
successor  relations,  it  assumes  that  there  is  a  single  path  that  must  be 
followed  from  start  to  completion  of  a  problem.  More  complex  types  of 
networks  (e.g.,  hierarchies  or  heterarchies)  could  also  be  used.  Such 
models  more  easily  allow  for  the  fact  that  the  user-computer  interaction 
can  take  a  variety  of  paths. 

An  example  of  a  hierarchical  network  model  is  a  "procedural  net" 
(Sacerdoti,  1975).  A  procedural  net  represents  the  possible  solutions  to  a 
problem  in  levels,  with  each  successive  level  involving  more  detail  than  its 
predecessors.  For  example,  one  procedure  might  specify  "classify  incoming 
messages"  and  other  procedures,  at  lower  levels  in  the  hierarchy,  would  specify 
in  more  detail  how  this  was  to  be  accomplished.  P» -'decessor-successor 
relations  are  defined  between  levels  and  also  within  levels.  At  a  given 
level,  the  procedures  are  linked  together  by  predecessor  and  successor 
relations  that  define  a  partially  ordered  sequence  of  operations  neces¬ 
sary  to  carry  out  some  higher-level  procedure.  Examples  of  procedural 
net  models  that  allow  for  heterarchical,  rather  than  strictly  hierarchical 
relations,  are  presented  in  Brown  and  Burton  (1978)  and  in  Van  Lehn 
and  Brown  (1978). 

Although  more  general  than  strictly  linear  models,  procedural 
net  models  are  more  complex  to  construct  and  have  currently  been  developed 
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for  only  relatively  simple  types  of  tasks  (learning  mechanical  assembly, 
Sacerdoti ;  learning  arithmetic.  Brown  and  Burton). 


Baker's  model  (Baker,  1970;  Siegel  et  al ,  1973)  is  a  particularly 
good  example  of  a  network  model  used  to  investigate  the  properties  of  a 
particular  user-computer  system.  However,  like  any  sophisticated  model, 
it  was  expensive  to  construct.  Furthermore,  by  modeling  at  a  detailed 
level  (a  strength  of  the  model)  it  achieves  very  limited  general izability 
to  other  systems  or  tasks.  Such  difficulties  apply  to  all  modeling  ap¬ 
proaches,  and  should  be  carefully  considered  before  attempting  model 
development. 

Control -Theory  Models 

As  their  name  implies,  these  models  are  best  suited  for  control -type 
tasks.  From  a  user  performance  perspective,  control  generally  does  not 
involve  problem  solving,  in  the  sense  that  the  user  must  actively  search 
for  a  solution  or  resolution  of  the  current  task.  Rather,  this  type  of 
task  generally  involves  fairly  automatic  application  of  well-learned 
algorithms  and  procedures.  In  this  case,  user  performance  variance  is 
extremely  low  and  statistical  estimation  procedures  provide  adequate 
descriptions  of  performance. 

In  control -theory  models,  the  user  is  treated  as  an  element  in 
a  feedback-control  loop.  Because  both  user  and  system  performance  vari¬ 
ances  are  low,  very  good  prediction  of  overall  user-computer  system  per¬ 
formance  can  be  obtained.  A  potential  pitfall  of  such  models  is  that 
treating  the  user  as  an  element  in  a  control  loop  may  lead  the  designer 
to  not  explicitly  focus  on  the  properties  of  the  user-system  interface. 

Decision-Theory  Models 

Decision- theory  models  are  applicable  to  tasks  in  which  the  user 
must  select  from  among  alternative  actions  and  evaluate  the  probable 
effects  of  applying  these  actions.  The  development  of  such  models 
assumes  that  the  criteria  with  respect  to  which  actions  are  to  be  eval- 
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uated  are  well-specified  and  that  a  "correct"  evaluation  of  each  al¬ 
ternative  action  exists.  Although  these  are  serious  restrictions,  and 
limit  the  types  of  tasks  to  which  decision-theory  models  are  applicable, 
the  successful  development  of  such  a  model  provides  the  designer  with 
an  important  advantage.  Decision-theory  models  can,  with  some  modifi¬ 
cation,  be  incorporated  into  an  interactive  system  as  decision  aids. 

A  decision-theory  model  indicates  not  only  what  aids  would  be  useful, 
but  also  indicates  how  they  should  be  implemented  (cf.,  Freedy  et  al , 
1976).  The  designer  should  note,  however,  that  the  task  involved  must 
be  understood  to  a  fairly  fine  level  of  detail  and  this,  as  a  consequence, 
limits  such  models  to  relatively  simple  situations. 

Models  of  Human  Information  Processing 

Although  potentially  the  most  general  model,  this  type  of  model 
is  also  the  most  difficult  to  construct.  The  view  of  a  human  as  an  infor¬ 
mation  processor  indicates,  in  part,  the  influence  of  computer  science 
concepts  on  psychology.  In  such  models,  it  is  not  uncommon  for  the  user's 
processing  abilities  to  be  represented  in  a  manner  similar  to  automata 
theory  or  Turing  machine  descriptions  of  computer  systems. 

The  generality  of  such  models  results  from  the  fact  that  an  ade¬ 
quate  model  of  human  information  processing  could  be  applied  to  any  task 
domain.  Although  there  is  a  very  large  body  of  research  in  this  area, 
current  knowledge  is  still  insufficient  to  supply  all  of  the  detailed 
information  necessary  to  construct  such  a  model.  Such  models  have  been 
developed,  however,  for  well-specified  tasks,  for  which  it  is  not  neces¬ 
sary  to  develop  a  complete  model  of  the  user  as  an  information  processor. 
If  the  designer  can  find  an  existing  model  that  is  applicable  to  the 
current  system  design,  it  may  be  advantageous  to  use  it.  If  no  such 
model  exists,  however,  developing  such  a  model  is  probably  beyond  the 
capabilities  and  constraints  of  most  interactive  system  designers. 
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These  models  are  intended  primarily  to  describe  computer  system 
behavior.  In  general,  they  give  inadequate  attention  to,  or  completely 
ignore,  user  performance  or  the  user-computer  interface.  These  models 
are  most  useful  in  determining  whether  or  not  such  user  requirements 
as  system  response  time  are  satisfied.  They  are  also  useful,  as  models 
of  existing  systems,  for  "fine  tuning"  overall  system  performance  for 
simple,  well-defined  tasks. 

Models  of  the  User  Alone 

In  Figure  8,  we  presented  the  major  approaches  to  the  modeling 
of  interactive  systems.  A  distinction  needs  to  be  made,  however,  be¬ 
tween  models  of  interactive  systems,  as  such,  and  models  of  either  the 
user  or  the  computer  system  in  isolation.  Even  in  a  given  approach  to 
modeling  interactive  systems,  differential  attention  can  be  given  to 
either  the  user  or  the  computer  system. 

Computer  system  models  are  near  one  end  of  this  continuum;  they 
focus  largely  on  the  computer  system  and  give  little  attention  to  the 
user.  The  remaining  approaches  shown  in  Figure  8  were  originally  devel¬ 
oped  as  models  of  human  (and  thus  user)  performance  and  have  gradually 
been  adapted  to  include  computer  system  performance.  That  is,  the 
assumptions  underlying  these  approaches  were  derived  from  research  on 
human  behavior.  The  development  of  such  models  into  models  of  interactive 
systems  is  due,  in  part,  to  the  recognition  that  some  types  of  unaided 
human  behavior  are  also  observed  in  human  behavior  with  interactive 
systems. 

There  are  a  large  number  of  models  of  the  user  alone  that  have 
not  yet  been  adapted  as  models  of  interactive  systems.  Such  models  are 
generally  represented  as  simulation  models  of  human  performance  in  speci¬ 
fic  task  domains.  Since  the  intent  is  to  develop  viable  psychological 
models  of  behavior,  these  models  tend  to  be  extremely  detailed. 
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Many  models  of  the  user  describe  human  performance  in  terms  of 
the  memory  and  process  limitations  of  the  human  information  processor. 
If  the  designer  can  find  an  appropriate  model,  then  the  interactive 
system  may  be  designed  to  overcome,  at  least  partially,  the  effects 
of  these  limitations.  In  general,  however,  the  amount  of  detail  re¬ 
quired  in  such  models  limits  their  development  to  relatively  simple 
tasks,  which  may  be  of  little  interest  to  interactive  system  designers. 
Further,  it  may  take  a  great  deal  of  sophistication,  on  the  part  of 
the  designer,  to  translate  the  psychological  terms  and  concepts  of  such 
models  into  a  form  suitable  to  express  a  system  design. 

Model  Validation 


In  general,  validating  a  model  requires  that  the  model  produce 
predictions  of  system  performance  that  can  be  compared  with  observed 
system  performance.  Since  the  basic  purpose  of  such  models  is  to  pro¬ 
vide  the  designer  with  a  tool  for  exploring  design  alternatives  before 
committing  to  a  design  structure,  however,  such  validation  is  ordinarily 
impossible.  In  this  case,  model  validity  is  a  function  of  the  adequacy 
of  the  user  and  system  data  that  were  collected  and  the  appropriateness 
of  the  chosen  model  for  a  specific  type  of  task.  Without  sufficient 
data  on  user  and  system  requirements,  model  construction  can  be  a  futile 
exercise.  However,  attempting  to  construct  a  model  may  point  out  in¬ 
adequacies  in  the  data  collection  and  analysis  process  and,  as  a  result, 
lead  the  designer  to  consider  requirements  that  were  neglected  before. 

Conclusions 


As  was  indicated  above,  there  are  a  large  number  of  constraints 
on  the  development  of  models  of  interactive  systems.  The  selection  of 
a  modeling  approach  must  be  based  on  the  task  domain  involved.  Further, 
the  designer  must  have  a  good  understanding  of  this  task  domain  and 
currently  successful  models  are  restricted  to  rather  simple  domains. 

As  a  result,  no  generally  applicable  approach  to  developing  models  of 
interactive  systems  exists. 
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It  has  been  suggested  elsewhere  in  this  report  that  restricting 
the  application  domain  of  a  design  guide  might  allow  a  more  explicit 
and  detailed  treatment  of  the  tasks  involved.  This  may  also  allow  a 
more  useful  discussion  of  existing  models  and  modeling  techniques 
applicable  in  the  chosen  application  area.  By  concentrating  on  a 
particular  kind  of  interactive  system,  the  design  guide  might  even  be 
able  to  suggest  a  basic  model  as  a  starting  point. 

Task  Allocation 

In  the  "traditional"  approach  to  the  design  of  person-machine 
systems,  the  next  step  after  requirements  data  collection  is  an  alloca¬ 
tion  of  the  identified  tasks  to  either  the  user  or  the  system.  This 
classical  "task  allocation"  method  is  discussed  very  well  by  Chapanis 
(1965).  It  is  implicitly  assumed,  in  this  method, that  the  individual 
tasks  identified  by  (for  example)  task  analysis  are  highly  independent. 
Thus,  any  one  of  these  tasks  can  be  assigned  to  either  the  user  or  the 
system  with  little  impact  on  the  performance  of  other  tasks.  Typically, 
each  task  is  analyzed  to  determine  which  system  component  is  best  suited 
to  the  task,  and  the  task  is  assigned  accordingly. 

For  some  kinds  of  systems  (e.g.,  manual  control,  simple  clerical 
systems),  this  method  works  reasonably  well,  but  it  has  deficiencies 
which  make  it  inappropriate  for  most  modern  interactive  computer  system 
design  efforts.  The  most  effective  desiqns  are  often  those  which  use 
the  computer  to  assist  and  complement  the  user  in  difficult  tasks,  rather 
than  simply  assigning  the  mechanical  tasks  to  the  computer  (see  Jordan, 
1963,  Vaughan  &  Mavor,  1972).  In  fact,  interactive  aids  often  allow 
the  task  to  be  approached  in  a  way  which  is  impossible  for  the  unaided 
user.  There  is  nothing  in  the  traditional  task-analysis/task-allocation 
method  which  helps  the  designer  recognize  the  need  for  --  or  nature  of  -- 
a  novel  interactive  approach  to  problem  solving.  If  the  method  is  em¬ 
ployed  uncritically,  it  may  simply  lead  to  automation  of  an  existing, 
inferior  approach. 


This  is  a  significant  pitfall  for  the  conscientious  designer 
who  is  not  well  educated  in  human  factors.  Because  this  method  works 
well  in  many  systems  which  do  not  involve  interactive  computer  aids,  it 
is  heavily  emphasized  in  many  standard  human  factors  texts.  The  un¬ 
critical  use  of  such  methods  can  place  the  designer  in  some  peril. 

A  design  guide  should  certainly  discuss  the  use  of  classical 
task  allocation,  but  should  also  outline  the  limitations  of  the  method 
and  help  the  designer  identify  those  circumstances  under  which  it  should 
be  applied. 

Unfortunately,  the  alternatives  available  to  the  designer  are 
more  difficult  to  identify.  If,  as  we  have  suggested,  design  is  pri¬ 
marily  an  analytic-synthetic  process,  the  designer's  experience  is 
necessarily  a  dominant  factor  in  recognizing  the  applicability  of 
particular  types  of  interactive  aids.  A  design  guide  might  assist  this 
process  by  providing  examples  of  different  types  of  aids,  along  with 
suggestions  about  the  types  of  tasks  in  which  they  are  appropriate. 

This  is  discussed  in  the  next  section. 

PROBLEM-SOLVING  AIDS 

In  this  section,  we  present  a  survey  of  the  types  of  aids  that 
have  been  used  for  man-computer  problem  solving  and  some  heuristics, 
or  "rules  of  thumb" for  selecting  an  appropriate  aid  for  a  given  situa¬ 
tion.  Since  the  usefulness  of  any  aid  is  ultimately  determined  by  the 
task  to  which  it  is  applied  and  the  problem-solving  behavior  that  is 
elicited  by  that  task,  we  will  begin  by  describing  common  types  of 
problem-solving  behavior.  Following  this,  we  will  consider  what 
specific  aids  are  likely  to  be  most  useful  for  each  type  of  behavior. 

"Man-computer"  problem  solving  is  defined  as  a  type  of  problem¬ 
solving  activity  that  involves  both  a  human  and  a  computer  component. 
"Human  problem  solving",  of  itself,  can  be  characterized  as  an  "informa- 
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tion  processing"  task.  The  human  has  a  set  of  cognitive  abilities 
and  mechanisms  that  can  be  applied  to  a  variety  of  tasks.  The 
extreme  versatility  that  can  be  observed  in  human  problem  solving 
is  due  to  the  fact  that  these  mechanisms  and  abilities  can  be  modi¬ 
fied,  within  limits,  and  ordered  in  different  ways.  Although  versa¬ 
tile  and  adaptive,  the  human  problem  solver  also  is  subject  to  fairly 
severe  memory  and  processing  limitations.  That  is,  human  memory  may 
be  insufficient  to  deal  with  all  of  the  information  that  is  generated 
by  some  process.  Or  a  process,  although  its  method  of  execution  is  well 
known,  may  require  more  processing  resources  than  the  human  has  avail¬ 
able,  perhaps  because  of  competing  processes.  "Man-computer  problem 
solving"  refers  to  a  situation  in  which  the  computer  is  used  to 
effectively  extend  the  human's  information  processing  resources  and 
reduce  the  effects  of  these  limitations. 

In  this  review,  we  will  consider  a  problem  to  consist  of  a  set 
of  "knowns"  and  a  set  of  "unknowns"  (cf,  Greeno,  1973).  In  a  geo¬ 
metry  problem,  for  example,  the  "knowns"  are  the  initial  situation  and 
the  theorems  and  axioms  that  can  be  applied  to  it, and  the  "unknowns" 
are  the  theorems  and  proofs  to  be  derived.  "Problem  solving"  refers 
to  the  process  of  generating  a  connection  between  the  "knowns"  and 
"unknowns".  Although  this  definition  is  extremely  broad  and  appears 
to  encompass  a  large  number,  if  not  all,  human  activities,  it  is 
important  to  note  that  not  all  tasks  are  considered  to  be  problem¬ 
solving  tasks. 

In  determining  whether  a  given  task  is  or  is  not  a  problem¬ 
solving  task,  it  is  necessary  to  determine  whether  the  connection 
between  the  "knowns"  and  "unknowns"  is  generated  or  recognized.  In 
story  or  scene  understanding,  for  example,  there  are  "knowns"  and 
"unknowns".  In  story  understanding,  we  may  be  presented  with  a  set 
of  characters  ("knowns")  who  are  performing  certain  actions  in  order 
to  achieve  some  goals  ("unknowns").  The  connection,  however,  is  pre¬ 
sented  in  the  story  and  the  reader  need  only  recognize  it;  it  is  not 
generated. 
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Although  story  comprehension  is  perhaps  the  clearest  example 
of  a  task  that  involves  recognition  rather  than  generation,  other  tasks 
also  have  a  predominant  recognition  component.  Examples  include  signal 
detection,  surveillance  and  monitoring  tasks,  or  any  task  in  which  the 
principal  activity  involves  noticing  that  some  clearly  coded  event  occurs 
or  is  present. 

All  of  the  aids  considered  in  this  section  are  concerned  with  tasks 
that  primarily  involve  generation.  Aids  for  recognition  tasks,  which  typi¬ 
cally  involve  display  coding  or  time-compressed  displays,  are  discussed 
elsewhere  in  this  report. 

Ideally,  a  system  designer  should  be  able  to  select  appropriate 
problem-solving  aids  on  the  basis  of  the  problem-solving  task  involved. 

For  example,  if  the  system  is  designed  for  control  tasks  or  logistics 
tasks,  then  specific  aids  could  be  apparent.  Unfortunately,  for  two 
reasons,  this  is  not  the  case.  First,  although  we  can  define  problem 
solving  as  a  generation  task,  it  must  be  noted  that  problem  solving  is 
not  a  unitary  task,  but  typically  involves  a  variety  of  subtasks.  In  a 
given  problem-solving  situation,  one  or  more  of  these  subtasks  is  generally 
more  crucial  than  the  others.  A  description  of  each  of  these  subtasks 
is  presented  in  Figure  9.  The  problem-solving  behavior  that  is  most  appro¬ 
priate  for  a  given  task  is  determined  largely  by  the  subtasks  that  are 
most  important  to  successful  performance  rather  than  by  the  overall  task 
itself. 


Both  the  specific  task  and  the  experience  of  the  user,  therefore, 
affect  the  type  of  problem-solving  behavior  that  will  be  employed.  In 
selecting  problem-solving  aids,  it  is  necessary  for  the  designer  to  con¬ 
sider  the  type  or  types  of  behavior  that  will  be  exhibited.  In  a  separate 
section  of  this  report,  we  described  methods  for  requirements  analysis. 

If  the  designer  is  primarily  concerned  with  a  man-computer  problem-solving 
system,  then  this  analysis  should  include  consideration  of  the  type(s) 
of  problem-solving  behavior  involved. 
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Figure  9  (Concluded). 


For  the  purposes  of  this  review,  we  will  consider  three  dimen¬ 
sions  along  which  behavior  can  be  classified.  Although  this  is  a  sim¬ 
plified  view  of  problem-solving  behavior,  it  seems  to  be  an  appropriate 
level  of  detail.  Finer  classifications  of  behavior,  though  possible, 
would  tend  to  obscure  the  commonalities  between  behavior  in  various 
task  domains  and  the  aids  most  appropriate  for  these  behaviors. 

Abstractions  --  No  abstractions 


Problem  solving  can  be  viewed  as  an  activity  that  takes  place 
within  a  "problem  space"  (cf . ,  Newell  and  Simon,  1972).  A  problem  space 
represents  the  problem  solver's  view  of  the  problem  to  be  solved,  the 
actions  that  can  be  taken  to  search  for  a  solution,  etc.  The  specific 
elements  in  the  problem  space  may  or  may  not  closely  correspond  to 
elements  in  the  environment  in  which  the  solution  will  be  implemented. 

In  problem  solving  by  abstraction,  the  problem  is  reformulated 
into  higher-level,  more  general  terms.  This  is  most  useful  in  large, 
complex  tasks  in  which  the  problem  elements  that  (eventually  need  be 
considered  would  exceed  the  problem  solver's  memory  resources.  Soft¬ 
ware  design  is  an  example  of  a  task  that  involves  abstraction;  a  designer 
initially  works  with  an  abstract  representation  of  the  design,  rather 
than  with  the  algorithms  or  code  that  will  eventually  implement  the 
design.  After  an  abstraction  is  formed,  problem-reduction  heuristics 
are  generally  applied  to  partition  the  original  problem  into  a  series  of 
(hopefully)  independent  subproblems,  which  can  be  dealt  with  at  the 
level  of  detail  required  by  the  final  solution. 

In  problem  solving  with  no  abstractions,  all  actions  that  the 
problem  solver  may  generate  are  directly  related  to  the  problem-solving 
environment.  That  is,  all  actions  could  be  executed  immediately.  This 
criterion  is  useful  in  deciding  whether  a  problem  involves  abstractions 
or  not.  In  problem  solving  by  abstraction,  the  effects  of  considered 
actions  must  be  simulated  by  the  problem  solver,  since  they  cannot  be 
directly  applied. 


In  problem  solving  by  abstraction,  the  primary  subtasks  involved 
are  problem  definition  and  strategy  selection.  Since  abstraction  implies 
that  the  problem  will  be  reformulated,  it  is  important  that  an  appro¬ 
priate  problem  definition  be  used.  Strategy  selection,  in  this  case, 
refers  to  the  choice  of  an  appropriate  problem  reduction.  In  general, 
problem  solving  by  abstraction  is  effective  only  if  the  problem  can  be 
reduced  into  a  series  of  subproblems. 

When  no  abstractions  are  used,  the  focus  is  on  alternative  genera¬ 
tion,  alternative  evaluation,  and  alternative  execution.  Since  the 
actions  that  can  be  taken  will  directly  affect  the  environment,  appro¬ 
priate  actions  must  be  considered  (alternative  generation)  and  the 
effects  of  actions  must  be  known  before  the  action  is  applied  (alterna¬ 
tive  evaluation).  Because  actions  can  directly  affect  the  environment, 
successful  problem  solving  requires  a  successful  interaction  with  the 
environment,  which  is  determined  by  the  alternative  execution  subtask. 

Search  --  No  search 


Search  behavior  implies  that  a  solution  to  the  current  problem 
is  sought  by  searching  through  a  set  of  possible  solutions.  This  type 
of  behavior  generally  involves  the  application  of  inference  rules  or 
heuristics  to  newly  presented  or  generated  information  in  order  to  test 
whether  a  solution  is  achieved.  Although  the  problem  solver  may  know 
all  possible  solutions,  it  is  more  likely  that  only  the  inference  rules 
that  test  for  a  solution  (and  conceivably  could  generate  all  possible 
solutions)  are  known.  No-search  behavior  implies  that  previously 
developed  solutions  (or  parts  of  solutions)  are  used  in  problem  solving. 
The  problem  solver  recognizes  that  the  problem  (or  part  of  the  problem) 
has  been  solved  before  and  7.n  appropriate  solution  is  retrieved.  It 
may  frequently  be  the  case  that  the  retrieved  solution  needs  some  modi¬ 
fication  before  it  can  be  applied,  but  it  is,  in  most  important  respects, 
at  least  an  appnpriate  "outline"  for  the  required  solution. 


Search  behavior  primarily  involves  three  subtasks.  Problem 
definition  leads  to  the  selection  of  inference  rules  and  determines 
the  possible  solutions  that  will  be  considered.  The  strategy  selec¬ 
tion  subtask  determines  how  the  selected  rules  will  be  applied.  Similar¬ 
ly,  the  generation  of  feasible  solutions  can  be  considered  as  alterna¬ 
tive  generation. 

In  no-search  behavior,  the  primary  focus  is  on  alternative  evalua¬ 
tion  and  alternative  execution.  Alternative  evaluation  refers  to  the 
need  to  evaluate  the  appropriateness  of  the  alternative,  retrieved 
solutions,  and  alternative  execution  is  concerned  with  mapping  the  re¬ 
trieved  solution  onto  the  problem-solving  environment. 

Design  tasks  are  examples  of  situations  in  which  no-search  behavior 
is  frequently  observed.  In  engineering  design,  for  example,  an  experi¬ 
enced  designer  is  well  aware  of  the  possible  structures  that  represent 
valid  designs,  and  is  generally  able  to  retrieve  a  previously  developed 
design  and  modify  it  to  fit  the  current  problem.  In  tasks  in  which  there 
is  not  a  great  deal  of  commonality  or  similarity  among  problem  solutions, 
search  behavior  is  typically  observed.  Scheduling  or  resource  allocation 
in  highly  dynamic  environments  are  examples  of  such  tasks. 

Data-driven  --  Conceptually-driven 


Data-driven  behavior  is  guided  by  particular  aspects  of  the  task 
domain.  That  is,  certain  domain-dependent  aspects  of  the  problem  sug¬ 
gest  "crucial"  steps  in  problem  solving  or  suggest  necessary  problem¬ 
solving  processes.  Command  and  control  and  medical  diagnosis  are  examples 
of  such  tasks.  Conceptually-driven  behavior,  on  the  other  hand,  is  guided 
by  the  problem  solver's  previous  experiences  with  similar  domains.  Most 
design  tasks  are  in  this  category. 

In  data-driven  behavior,  problem  recognition  and  alternative 
evaluation  are  the  most  crucial  problem-solving  subtasks.  Problem 
recognition  is  important  since  salient,  relevant  aspects  of  the  environ- 


merit  must  be  recognized  and  reacted  to.  Alternative  evaluation  is 
important  since  this  type  of  behavior  focuses  on  detailed  aspects 
of  the  environment  and  the  effects  of  actions  at  a  low  level  of  de¬ 
tail  must  be  well  understood. 

In  conceptually-driven  behavior,  the  primary  subtasks  are  problem 
definition,  goal  definition,  and  strategy  selection.  Since  conceptually 
driven  behavior  is  guided  by  previous  experiences,  the  problem  defini¬ 
tion  subtask  should  result  in  the  problem  being  formulated  so  that 
previous  relevant  experiences  will  be  retrieved.  Since  these  retrieved 
experiences  are  frequently  goal-oriented  —  that  is,  selected  on  the 
basis  of  the  goal  that  is  desired  --  goal  definition  is  crucial.  In 
addition,  the  goal  definition  subtask  should  ensure  that  the  selected 
goal  is  both  appropriate  and  attainable.  Strategy  selection  determines 
how  the  retrieved  information  will  be  applied. 

Types  of  problem  solving  aids  that  have  been  proposed  and  deve¬ 
loped,  to  varying  degrees  of  detail,  are  described  in  Figure  10.  The 
reader  should  note  that  only  general  descriptions  of  aids  are  provided 
and  the  exact  form  that  an  aid  will  take  upon  implementation  may  depend 
on  the  specific  task  domain  and  the  type  of  behavior  involved.  For 
example,  strategy  selection  for  problem  solving  involving  abstraction 
may  suggest  problem-reduction  heuristics,  but  if  search  behavior  is 
involved,  an  interactive  aid  might  suggest  which  inference  rules  to 
apply. 

Figure  11  indicates  which  aids  are  most  likely  to  improve  per¬ 
formance  as  a  function  of  the  type  of  problem-solving  behavior  involved. 
Only  those  aids  that  are  considered  to  be  most  useful  for  a  given  type 
of  behavior  are  indicated.  Aids  other  than  those  indicated  could  also 
be  used,  but  their  effects  would  be  expected  to  be,  in  most  cases,  less 
significant.  Some  of  the  aids  listed  in  Figure  10  are  not  associated 
with  a  particular  behavior  type.  The  reader  is  referred  to  Figure  10 
for  comments  on  the  use  of  these  aids. 
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Figure  10.  Types  of  Problem-solving  Aids  (Page  1  of  2). 
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The  greatest  difficulty  involved  in  making  our  knowledge  of  problem¬ 
solving  aids  useful  to  the  designer  is  the  relatively  abstract  level  of  that 
knowledge.  If  design  guidelines  were  to  be  written  for  a  very  limited  class  of 
systems,  it  might  be  possible  to  suggest  very  specific  aids.  For  more  general 
guidelines,  it  would  be  necessary  to  greatly  simplify  the  language  in  which 
problem  solving  has  just  been  discussed,  and  provide  guidance  to  help  the  de¬ 
signer  recognize  the  kinds  of  problem-solving  behavior  involved  in  the  parti¬ 
cular  tasks  at  hand.  Furthermore,  it  would  probably  be  necessary  to  provide 
very  explicit  examples  of  the  various  types  of  aids  which  are  known.  This  all 
appears  feasible,  but  is  by  no  means  easy,  and  we  are  unaware  of  any  such  guide¬ 
lines  which  have  been  developed  in  the  past.  If  designers  operate  primarily  by 
synthesizing  designs  from  already  known  elements,  such  a  presentation  might 
greatly  enhance  the  designer's  utilization  of  aiding  techniques  which  happen  to 
lie  outside  the  designer’s  past  experience. 


INTERACTIVE  DIALOGUE 


After  the  designer  has  settled  upon  a  basic  set  of  funct  ons 
to  be  performed  by  an  interactive  system,  the  next  major  step  is  the 
determination  of  the  nature  of  the  interactive  dialogue.  There  are 
a  number  of  fundamentally  different  types  of  dialogue,  and  a  very 
large  number  of  properties  which  an  interactive  dialogue  may  exhibit. 

A  systematic  treatment  of  this  topic  seems  to  require  that  the  basic 
properties  and  types  of  dialogue  be  discussed  independently  of  such 
factors  as  the  details  of  the  command  language,  display  formatting, 
selection  of  display  and  input  devices,  etc.  A  separate  discussion 
of  basic  issues  is  particularly  appropriate  since  there  is  much  less 
explicit  guidance  available,  in  the  human  factors  literature,  with 
respect  to  selection  of  a  dialogue  type  than  with  respect  to  the  de¬ 
tailed  design  of  the  dialogue  after  the  type  is  determined. 

This  section  discusses  interactive  dialogue  in  general,  and 
provides  a  more  detailed  treatment  of  several  common  dialogue  types. 
Questions  of  display  formatting,  display  coding,  and  display  and  input 
device  selection  are  reserved  for  later  sections. 

BASIC  DIALOGUE  PROPERTIES 

Prior  to  a  discussion  of  various  dialogue  types,  it  is  appro¬ 
priate  to  consider  a  few  basic  properties  on  which  interactive  dia¬ 
logues  may  differ.  Figure  12  contains  a  summary  discussion  of  some  of 
the  most  important  properties.  Many  other  aspects  of  interactive  dia¬ 
logue  could  be  included  here,  but  the  listed  properties  are  particularly 
important,  apply  to  all  dialogue  types,  and  have  at  least  some  empiri¬ 
cal  data  which  might  be  of  use  to  the  designer. 

t 

Initiative 


"Initiative"  is  probably  the  most  basic  interactive  dialogue 
property.  In  computer-initiated  dialogue,  the  user  inputs  commands. 
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Figure  12.  Some  Major  Properties  of  Interactive  Dialogues  (Page  1  of  2). 
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parameter  values,  etc.,  in  response  to  some  form  of  prompting  by 
the  computer,  such  as  a  question  or  a  list  of  alternatives.  In 
user-initiated  dialogue,  no  such  promoting  occurs,  and  the  user  must 
generate  the  necessary  input  from  memory.  "Mixed-initiative"  dia¬ 
logues,  in  which  the  dialogue  may  freely  be  "led"  by  either  the  user 
or  the  computer,  also  occu>"  (e.g.,  Carbonell  &  Collins,  1970),  but 
these  tend  to  be  limited  to  fairly  sophisticated  systems  using  natural 
language  (see  the  later  section  on  natural-language  dialogue).  Variable 
initiative,  in  which  the  user  (or  occasionally  the  system)  selects 
either  user-  or  computer-initiated  dialogue,  is  much  more  common. 

Computer-initiated  dialogue  has  several  advantages.  It  allows 
reliance  on  the  passive  vocabulary  of  the  user  (the  set  of  words  which 
the  user  can  recognize  and  understand),  which  is  typically  much  larger 
than  the  user's  active  vocabulary  (words  which  the  user  can  generate 
and  use  without  prompting).  It  also  allows  the  designer  to  implicitly 
convey  to  the  user  a  "mental  model"  of  the  system's  dialogue  structure. 

IThis  can  be  very  helpful  to  the  new  or  casual  user,  and  may  help  to 

reduce  the  adverse  effects  of  a  satisficing  strategy  (Eason,  1976)  by 
constantly  reminding  the  user  of  the  existence  of  unused  commands. 
Computer-initiated  dialogue  can  also  reduce  the  clerical  workload  of 
the  user  by  allowing  selection,  rather  than  typing,  of  words,  and  can 
even  allow  the  selection  of  nonverbal  command  arguments  in  graphical 
systems.  Common  types  of  computer-initiated  dialogue  include  menu 
selection,  form-filling,  and  question-and-answer  techniques,  all  of 
which  will  be  discussed  in  the  later  section  on  dialogue  types.  Exam¬ 
ples  and  discussion  of  particularly  effective  computer-initiated  dia¬ 
logue  can  be  found  in  Thompson  (1969,  1971)  and  Johnson  (1977). 

Computer-initiated  dialogue  also  has  two  principal  disadvantages. 
First,  such  a  dialogue  can  constrain  the  amount  of  information  conveyed 
in  a  single  transaction,  sometimes  with  detrimental  results.  For  exam¬ 
ple,  McAllister  and  Bell  (1971)  discuss  a  bibliographic  system  in  which 
the  user  potentially  goes  through  84  separate  frames  just  to  order  a 
book.  While  this  may  be  an  appropriate  dialogue  for  the  extreme  novice. 
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it  is  almost  certainly  detrimental  for  the  experienced  user.  Second, 
the  use  of  computer-initiated  dialogue  can  result  in  system-response¬ 
time  delays  which  are  unacceptable,  especially  for  experienced  users. 
This  is  a  function  not  only  of  dialogue  design,  but  also  of  hardware 
and  software  configuration.  For  example,  computer-initiated  light-pen, 
menu-selection  dialogues  for  command  construction  may  be  extremely 
effective  when  controlled  by  a  "smart"  terminal  --  whose  response 
time  may  be  psychologically  negligible  --  and  may  be  totally  disruptive 
when  the  same  dialogue  is  controlled  by  a  host  computer  with  response 
times  of  a  few  seconds.  A  later  section  on  response  time  treats  this 
issue  in  more  detail. 

It  is  important  to  note  that  even  the  use  of  a  rapid,  "smart" 
terminal  does  not,  by  itself,  resolve  the  first  difficulty,  which  is 
a  question  of  dialogue  structure  and  transaction  content.  In  general, 
though,  actions  which  the  user  must  frequently  make  must  either  be  user- 
initiated  or  be  part  of  a  relatively  rapid  interchange,  if  the  dialogue 
is  to  be  satisfactory  to  the  frequent  user.  If  a  computer-initiated 
dialogue  would  cause  the  response-time  guidelines  (discussed  in  the 
later  section  on  response  time)  to  be  exceeded  frequently  by  the  ex¬ 
pected  system  user,  then  user-initiated  dialogue  should  be  provided. 
Often,  the  best  solution  is  a  computer-initiated  default  dialogue,  with 
a  provision  for  the  experienced  user  to  override  this  mode.  Nickerson 
and  Pew  (1971)  present  several  good  suggestions  with  respect  to  such 
dual -mode  dialogues.  For  systems  intended  for  dedicated,  well  trained 
users,  a  purely  user-initiated  dialogue  may  be  satisfactory. 

Flexibility,  Complexity,  and  Power 

Flexibility,  complexity,  and  power  are  related,  but  clearly 
differentiable,  dialogue  properties.  There  is  an  understandable 
tendency  on  the  part  of  designers  to  treat  these  properties  quite 
simpl istical ly:  flexibility  is  good,  complexity  is  bad,  power  is  good. 
The  limited  body  of  empirical  data  bearing  on  these  issues  suggests 


that  this  treatment  is  inappropriate.  It  is  likely  that  each  of  these 
properties  is  related  to  performance  in  a  nonmonotonic  manner,  and 
that  the  performance  effects  also  vary  with  task  and,  especially,  user 
properties. 

Flexibility  (as  defined  in  the  table)  is  difficult  to  measure 
in  an  existing  dialogue  design.  It  is  possible,  though,  to  devise 
dialogues  which  accomplish  the  same  purpose,  but  differ  primarily  on 
this  variable.  Walther  (1973;  see  also  Walther  &  O'Neil,  1974)  varied 
dialogue  flexibility  in  a  study  involving  varying  levels  of  user 
experience.  Highly  flexible  dialogues  were  found  to  help  very  experi¬ 
enced  users,  but  to  degrade  performance  of  moderately  experienced  com¬ 
puter  users  to  a  significant  degree,  especially  by  increasing  error 
rates.  There  is  also  evidence,  from  the  MICA  study  (Eason,  1976; 

Stewart,  1976b),  that  users  adopt  a  satisficing  strategy  with  respect 
to  utilization  of  flexible  dialogues,  and  may  thus  fail  to  take  advan¬ 
tage  of  the  flexibility  provided. 

"Complexity,"  as  the  term  is  used  by  Carlisle  (1974)  is  a  mea¬ 
sure  of  the  number  of  alternatives  from  which  the  user  must  choose  at 
any  given  point  in  the  dialogue.  This  is  not  the  same  as  structural 
complexity;  in  fact,  "complexity"  can  be  reduced  by  imposing  a  tree 
structure  on  the  dialogue,  which  reduces  the  number  of  alternatives 
at  any  point,  but  increases  the  complexity  of  the  dialogue  structure. 
Carlisle  studied  the  effect  of  interface  "complexity"  on  users  differing 
in  experience  and  on  several  cognitive  attributes  (e.g.,  verbal  intelli¬ 
gence).  Although  medium  complexity  (12  options  at  each  node,  as  con¬ 
trasted  with  3  and  21  in  the  other  conditions)  yielded  the  most  inter¬ 
action  and  the  greatest  variety  of  command  usage,  performance  effects 
were  inconsistent.  Complexity  interacted  somewhat  with  user  properties. 

Using  nonsense  syllables  as  "commands,"  Baker  and  Goldstein  (1966) 
demonstrated  that  performance  suffers  when  the  displayed  list  of  alter¬ 
natives  includes  options  not  currently  relevant.  However,  it  is  not 
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clear  that  this  result  is  fully  generalizable  to  dialogue  involving 
meaningful  commands  and  structure.  In  an  unpublished  study,  Boies 
(1977)  compared  a  language  with  a  small  number  of  generic  commands 
and  a  large  number  of  arguments  against  a  language  with  many  specific 
commands  and  few  arguments.  The  generic  organization  was  reportedly 
more  useful. 

Complexity,  then,  is  clearly  a  relevant  variable,  but  no  clear 
guidance  with  respect  to  optimal  complexity  emerges  from  the  surveyed 
literature.  In  passing,  it  might  be  noted  that  some  designers  are  un¬ 
critically  using  Miller's  (1962)  "magical  number  seven  plus  or  minus 
two"  as  a  guideline  for  the  optimal  number  of  dialogue  alternatives. 

It  is  not  clear  that  short-term  memory  capacity  is  the  limiting  factor 
here,  and  this  convention  should  be  considered  arbitrary. 

Dialogue  "power"  is  concerned  with  the  amount  of  work  done  by 
one  command.  Where  the  user  executes  high-level  procedures,  each  of 
which  can  be  specified  as  a  sequence  of  lower-level  procedures,  the 
designer  has  some  choice  of  command  level.  For  example,  the  user  of 
a  mathematical  system  might  accomplish  a  matrix  addition  by  executing 
a  series  of  scalar  additions,  or  a  single  matrix  addition  command  might 
be  provided  (as  in  Wood,  1972).  Clearly,  the  Boies  study,  discussed 
above,  bears  on  this  issue  also.  There  is  no  standard  metric  of  dialogue 
power,  but  Halstead's  (1973)  approach  to  the  measurement  of  programming 
language  power  may  be  applicable  within  restricted  task  domains  which 
have  a  fairly  well  accepted  set  of  "primitive"  procedural  operations. 

In  interactive  dialogues,  high  power  is  usually  accompanied  by  either 
high  complexity  or  restricted  generality.  In  the  MICA  survey,  the  latter 
was  a  significant  factor  in  system  rejection,  especially  by  scientific 
and  technical  users  (Stewart,  1976b). 

Although  no  clear  guidelines  emerge  with  respect  to  these  three 
dimensions,  it  is  demonstrable  that  the  prevalent  attitude  of  the  design 
community  toward  them  is  simplistic.  A  discussion  of  the  issues  in  a 
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design  guide  may,  at  the  very  least,  cause  a  more  situation-specific 
consideration  of  these  factors  by  the  designer. 

Information  Load 

The  information  load  imposed  on  the  user  is  a  major  factor  in 
the  success  of  many  interactive  systems.  Although  several  authors  have 
expressed  a  general  concern  about  this  problem  (especially  Finkelman, 
1976;  Treu,  1975),  there  is  little  evidence  that  existing  knowledge  of 
the  measurement  and  effects  of  information  load  is  being  applied  to 
computer  system  design.  User  performance  can  be  adversely  affected 
by  information  loads  which  are  either  too  high  (the  usual  problem)  or 
too  low.  The  performance  effects  of  information  load  differ  signifi¬ 
cantly  depending  on  whether  it  is  the  memory  or  processing  resources 
(or  both)  which  are  overloaded  (Norman  &  Bobrow,  1975).  This,  in  turn, 
is  strongly  affected  by  the  complexity  of  the  task  and  the  experience 
of  the  user. 

While  the  demands  of  the  underlying  task  and  the  skill  of  the 
user  are  significant  determinants  of  information  load,  there  are  many 
parameters  of  the  interactive  dialogue  which  can  be  used  by  the  designer 
to  ensure  an  acceptable  information  load.  Among  the  major  possibilities 
for  reducing  information  load  are: 

The  use  of  display  devices  with  higher  channel  capacity 
(e.g.,  graphics  instead  of  alphanumerics) 

Reformatting  displays  for  improved  correspondence  to 
the  immediate  information  requirement  of  the  user 

Use  of  commands  of  appropriate  power  for  the  immediate 
task  of  the  user 

Use  of  a  language  which  minimizes  information  processing 
by  the  user  (e.g.,  natural  language,  a  command  language 
with  appropriate  complexity) 

Moving  clerical  operations  (e.g.,  manipulations  of  data 
before  data  entry)  into  the  system 
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Structuring  command  language  to  correspond  to  the 
substructure  of  the  user's  task 

Use  of  default  values. 

The  more  basic  issue  is  the  fact  that  user  information  load 
is  often  not  a  formal  consideration  during  system  design.  It  is 
possible  to  measure  user  information  load  during  actual  or  simulated 
system  use,  by  the  use  of  competing  tasks,  for  example  (see  Norman  & 
Bobrow,  1975),  and  information  load  can  be  estimated  from  detailed 
task-analytic  data.  Even  very  gross  task  analytic  data  can  be  used  to 
obtain  a  rough  estimate.  A  design  guide  should  review  the  effects  of 
information  load  on  user  performance.  This  information  is  found  largely 
in  non-computer-related  material  (e.g.,  Meister,  1976).  The  design 
guide  should  suggest  procedures  for  determination  of  information  load, 
including  both  precise  and  rough  measures.  Finally,  the  design  factors 
which  can  be  used  to  influence  information  load  should  be  discussed. 

System  Response  Time 


System  response  time  is  an  area  in  which  human  factors  considera¬ 
tions  are  clearly  of  concern  to  the  system  designer.  There  is  an  obvious 
tradeoff  here  between  performance  (slow  system  response  reduces  user 
satisfaction  and  degrades  user  performance)  and  cost  (rapid  response  can 
increase  costs  dramatically).  In  some  quarters,  there  is  a  tendency  to 
dismiss  this  issue  as  moot  because  of  rapid  advances  in  technology  which 
allow  ever  shorter  response  delay  at  reduced  cost.  It  should  be  kept  in 
mind,  however,  that  the  shift  to  networking,  distributed  processing, 
front-end  processors,  etc.,  presents  a  whole  new  pattern  of  response 
delays  due  to  communication  time,  shared  processing,  and  so  on.  Human 
factors  issues  associated  with  system  response  can  easily  affect  the 
basic  architecture  of  these  systems,  and  such  considerations  remain  a 
significant  element  in  cost-performance  tradeoffs  even  in  conventional 
systems. 


The  human  factors  concern  here  is  with  the  effects,  on  both 
user  performance  and  user  satisfaction,  of  delays  in  system  response 
and  of  variations  in  those  delays.  These  effects  depend  in  part  upon 
the  user's  expectations  with  respect  to  system  performance,  and  the 
user's  impression  of  the  system  activity  required  to  process  a  command 
or  query  (Carbonell  et  al ,  1968).  The  user's  mental  model  of  the 
system  may  not  correspond  to  the  designer's  or  to  reality,  so  that  the 
expectancies  involved  here  may  have  implications  for  training.  Most 
importantly,  the  effects  of  response  delays  depend  upon  the  relation¬ 
ship  of  the  particular  transaction  to  the  user's  ongoing  problem-solving 
process  (Miller,  1968). 

There  are  several  related  components  of  response  delay,  as 
illustrated  and  defined  in  Figure  13.  Notice  that,  for  the  purpose  of 
this  discussion,  SRT  is  defined  as  the  interval  from  the  user  inquiry 
to  completion  of  the  resulting  display,  including  writing  time.  A 
summary  discussion  of  the  effects  of  response  delays  and  SRT  variation 
is  presented  in  Figure  14. 

The  disruptive  effect  of  excessive  SRT  on  user  performance 
appears  to  result  largely  from  interference  with  the  user's  "continuity 
of  thought"  (Miller,  1968).  This  is  a  highly  relative  concept,  involv¬ 
ing  very  short  intervals  of  time  (seconds  or  fractions  of  seconds)  for 
procedural  steps  in  the  construction  of  a  command,  and  perhaps  quite 
long  intervals  (many  minutes)  when  the  user  has  reached  "closure"  with 
respect  to  a  basic  problem-solving  step.  In  the  former  case,  the  dis¬ 
ruption  is  probably  related  to  human  short-term  memory  limits  and 
attentional  phenomena.  The  key  to  satisfactory  performance  here  may 
be  maintenance  of  a  "conversational"  interaction.  In  this  instance, 
the  key  interval  may  be  SRIT,  rather  than  SRT. 

In  some  situations,  SRT  variation  may  be  a  more  important  human 
factors  problem  than  long  delays.  Several  authors  have  speculated  that 
such  variation  can  be  demotivating  to  the  user,  resulting  in  both 
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Figure  IT.  Summary  of  System  Response  Time  Effects 


performance  decrements  and  user  dissatisfaction.  Williams’  (1975) 
data  do  not  support  this  speculation,  but  do  not  persuade  us  to  abandon 
it,  either.  More  research  is  needed  to  justify  a  firm  conclusion. 

SRT  guidelines  can  be  offered  which  appear  quite  reasonable 
from  a  psychological  viewpoint,  although  little  explicit  empirical 
verification  can  be  claimed.  Miller  (1968)  presented  SRT  guidelines 
for  17  situations  ranging  from  single  key  depressions  to  completion  of 
a  request  for  a  major  computation.  Engel  and  Granda  (1975)  have  altered 
these  slightly,  and  have  also  presented  SRT  guidelines  (attributed  to 
W.  C.  Allison  and  R.  F.  Challman)  in  terms  of  the  user's  perception  of 
the  system  activity  rquired  to  satisfy  his  inquiry.  In  the  context  of 
an  extensive  design  guide,  it  should  be  possible  to  relate  such  guide¬ 
lines  more  clearly  to  user  tasks.  This  information  might  be  supplemented 
with  data  on  user  preparation  times  (inter-response  intervals)  associated 
with  procedural  steps  in  command  construction  and  other  "conversational" 
transactions.  Several  of  the  papers  surveyed  present  data  from  which 
rough  estimates  might  be  derived  for  various  dialogue  operations  at 
this  level.  The  resulting  guidelines  can  be  related  more  clearly  to 
specific  design  decisions,  such  as  terminal  selection  and  distributed 
processing  system  architecture. 

"Artificial  lockout"  is  a  separate,  but  related  issue.  This 
involves  restricting  a  problem  solver's  access  to  the  computer  for  some 
period  of  time  after  presentation  of  results  from  the  current  request. 
There  appears  to  be  some  evidence  that  this  may  cause  the  user  (in  a 
complex  problem-solving  situation)  to  concentrate  more  on  the  problem 
to  be  solved  and  less  on  the  tactics  used  to  solve  it,  with  resulting 
improved  performance,  at  some  cost  in  user  satisfaction.  This  technique 
is  related  to  Stewart's  (1976b)  advocacy  of  dialogue  techniques  intended 
to  break  the  user's  problem-solving  "set".  Although  a  design  guide 
could  make  the  designer  aware  of  this  technique  and  its  possible  advan¬ 
tages  and  disadvantages,  it  is  not  clear  that  more  detailed  guidance  can 
be  offered. 


TYPES  OF  INTERACTIVE  DIALOGUE 


The  selection  of  a  dialogue  type  or  types  is  one  of  the  most 
important  decisions  made  in  the  course  of  designing  an  interactive 
system.  This  section  will  discuss  several  major  dialogue  types  and 
outline  what  is  known  about  their  advantages,  disadvantages,  and 
detailed  design,  and  about  the  potential  for  development  of  human 
factors  guidelines  for  each  type.  In  addition  to  the  major  dialogue 
types  discussed  here,  there  are  many  minor  variants  and  many  possibi¬ 
lities  for  combining  dialogue  types.  Although  space  does  not  permit 
a  discussion  of  all  of  these  here,  a  detailed  design  guide  must  recog¬ 
nize  their  existence  and  probably  provide  assistance  with  particularly 
common  combinations  (e.g.,  menu  selection  or  user-initiated  mnemonic 
commands  combined  with  form-filling  for  data  entry). 

A  major  controversy  involves  the  applicability  of  a  human-to- 
human  communication  model  to  person-computer  dialogue.  The  proponents 
of  such  a  model  argue  that  interactive  dialogue  which  operates  in  the 
same  way  that  humans  communicate  with  one  another  is  inherently  natural 
and  flexible,  requires  little  user  training,  and  is  very  powerful.  The 
extreme  advocates  of  this  view  conclude  that  all  computer  systems  should 
communicate  in  natural  language  (e.g.,  English,  preferably  spoken),  per¬ 
haps  supplemented  with  interactive  graphics. 

The  opponents  of  this  view  usually  agree  that  "natural  language" 
is  natural  for  the  user,  flexible,  etc.  They  point  out,  though,  that 
it  is  also  highly  ambiguous,  and  that  a  great  deal  of  general  and 
problem-specific  knowledge  is  required  to  comprehend  interpersonal 
dialogue.  They  argue  that  natural-language  dialogue  will  not  be  cost- 
effective,  except  in  rare  instances,  for  many  years,  and  that,  in  most 
applications,  it  may  even  be  an  inherently  poor  substitute  for  simple, 
umambiguous  command  languages,  menu  selection  dialogues,  etc. 

It  seems  appropriate  to  decompose  this  controversy  into  three 
different  components,  which  often  seem  to  be  confused  with  one  another: 
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(1)  natural-language  dialogue,  (2)  "natural"  dialogue,  and  (3)  seman¬ 
tic  knowledge. 


The  question  of  "natural -language"  (e.g.  ,  English)  dialogue 
will  be  discussed  in  a  later  section,  but  warrants  a  brief  treatment 
here.  We  are  now  able  to  achieve  person-computer  interaction  via 
"natural  language",  provided  the  domain  of  conversation  is  limited, 
and  the  "natural  language"  is  itself  restricted  in  syntax  and  vocabul¬ 
ary.  Furthermore,  the  language  must  usually  be  "formal1  natural  language, 
such  as  one  might  find  in  a  textbook  on  (for  example)  English  gramriar. 
Unfortunately,  we  arQ  also  beginning  to  understand  that  this  isn't  the 
way  people  actually  talk  to  one  another.  Independently  of  the  techno¬ 
logical  issues  of  automated  production  and  "comprehension"  of  natural 
language,  we  need  research  like  that  of  Chapanis  (1975,  1976)  and  his 
colleagues  to  determine  the  nature  of  the  actual  language  used  in  inter¬ 
personal  dialogue  and  the  tolerance  of  humans  for  various  constraints 
on  their  language.  And  this  research  has  a  long  way  to  go  before  it 
will  be  directly  applicable  to  system  design. 

The  second  component  of  the  controversy  is  the  issue  of  "natural" 
dialogue.  Dialogue  need  not  be  conducted  in  natural  language  to  be 
"natural",  in  the  sense  that  it  requires  little  training,  "feels  right", 
etc.  In  some  domains  (e.g.,  mathematics,  engineering  drawing),  natural 
language  is  clearly  not  the  most  appropriate  dialogue  type.  It  is  likely, 
however,  that  much  of  what  we  know  about  the  nonlingual  aspects  of  natural 
conversation  (continuity,  response  time,  etc.)  is  applicable  to  the  achieve¬ 
ment  of  natural  dialogue,  even  if  the  dialogue  is  conducted  via  a  simple 
command  language.  This  aspect  of  natural  dialogue  has  been  discussed  very 
well  by  Foley  and  Wallace  (1974). 

The  third  component  concerns  semantic  knowledge.  Satisfactory 
natural-language  processing  requires  that  the  computer  have  considerable 
semantic  information  about  the  application  domain  and  about  prior  conver¬ 
sation  in  order  to  disambiguate,  and  ultimately  to  comprehend,  the  user's 
statements.  Such  knowledge  enables  the  system  to  respond  appropriately 
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in  the  context  of  the  user's  previous  interactions  with  the  comput< 
adapting  its  interpretations  and  responses  appropriately,  more  or  1 
as  a  human  would.  It  does  not  follow,  though,  that  such  knowledge 
only  relevant  when  natural-language  techniques  are  used.  Regardles 
of  dialogue  type,  "knowledge-based  systems"  have  great  (but  expen' 
and  long-range)  potential  for  adapting  to  the  user's  needs,  style, 
specific  problem  (see  Waksman,  1974).  This  is  the  basis  of  most  ol 
proposals  for  "personalized"  systems  (e.g.,  Negroponte,  1975). 

It  seems  likely  that  syntactic  aspects  of  natural-language  a 
not  the  most  important  of  these  three  issues.  In  any  case,  it  is  t 
"natural  dialogue"  issue  which  can  be  most  readily  addressed  by  a  c 
guide  in  the  near  future.  A  design  guide  might  be  able  to  acquaint 
designer  with  some  of  the  advantages,  disadvantages,  and  complexiti 
of  natural-language  processing  and  knowledge-based  systems,  but  it 
not  attempt  to  educate  the  designer  in  the  implementation  of  such  s 
If  designers  wish  to  employ  such  techniques,  and  do  not  already  hav 
knowledge  to  do  so,  they  would  be  well  advised  to  contact  a  special 

The  remainder  of  this  section  is  divided  into  subsections  de 
with  specific  dialogue  types.  Thus,  a  discussion  of  command  langua 
properties  appears  in  the  section  dealing  with  "user-initiated  comm 
languages",  even  though  some  of  the  material  is  potentially  relevan 
the  design  of  command  languages  for  menu-selection  dialogues.  This 
zation  seems  simpler,  for  the  current  report,  than  the  more  highly 
tioned  organization  which  a  design  guide  would  need.  It  does,  howe 
require  the  reader  to  recognize  that  the  discussions  of  language  pr 
perties  are  more  broadly  applicable  than  their  placement  in  the  rep 
might  suggest. 

There  is  a  potential  for  the  construction  of  guidelines  whic 
address  dialogue  details,  but  are  relatively  independent  of  dialogu 
type.  Potential  topics  for  these  guidelines  include  such  things  as 
deletion  by  a  two-step  process,  "undelete"  capability,  error  messag 


75 


fl 


features,  provision  of  system  status  information,  feedback  and  rein¬ 
forcement  of  the  user,  and  command  stacking.  Good,  though  brief, 
guidelines  of  this  sort  already  exist  (especially  in  Engel  &  Granda, 
1975;  Nickerson  &  Pew,  1971),  but  need  to  be  made  more  comprehensive 
if  the  design  guide  is  to  cover  a  wide  range  of  dialogue  types. 

We  have  attempted  to  distinguish  dialogue  types  purely  on  the 
basis  of  dialogue  properties,  without  regard  to  the  purpose  for  which 
the  dialogue  is  intended.  Thus,  we  address  such  dialogue  types  as  form¬ 
filling,  menu  selection,  and  natural  language  (see  Figure  15).  Martin's 

j 

(1973)  excellent  book  on  interactive  dialogue  contains  a  much  more  ex¬ 
tensive  taxonomy  of  this  sort.  Others  (e.g..  Miller  &  Thomas,  1977; 
Nicholson  et  al ,  1972)  have  found  it  useful  to  classify  dialogues  pri¬ 
marily  by  purpose.  For  example.  Miller  and  Thomas  discuss  the  functions 
which  should  be  provided  in  specific  types  of  interactive  software,  such 
as  editors,  file  managers,  query  systems,  programming  aids,  and  message 
processing  facilities.  Discussions  of  this  sort  could  provide  a  very 
useful  source  of  information  for  design  guidelines  intended  for  a  re¬ 
stricted  class  of  systems,  but  are  less  clearly  useable  for  a  general- 
purpose  design  guide.  Statistical  studies  of  command  usage  (see  Figure 
may  also  be  more  useful  when  dealing  with  specific  application  areas. 
Examples  of  effective  dialogue  (e.g.,  Borko,  1966)  are  easier  to  present 
in  such  a  context,  but  Martin  (1973)  has  done  an  effective  job  with 
examples  in  his  general  treatise. 

Quest ion-and-Answer 

In  a  question-and-answer  (Q  &  A)  dialogue,  the  user  responds  to 
questions  asked  by  the  computer.  For  the  computer-naive  user,  this  is 
probably  the  simplest  dialogue  type.  Card  et  al  (1974)  and  Lucas  (1977) 
report  a  highly  successful  application  of  this  dialogue  type  in  medical 
interviewing  of  totally  naive  users.  Only  yes-no  questions  were  used, 
and  the  user  had  a  keyboard  with  three  kevs  marked  "Yes",  "No",  and  "?". 
With  no  training,  users  successfully  completed  the  questionnaires  and 
reported  satisfaction  with  the  system. 
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Q  &  A  dialogues  can  be  used  for  much  more  sophisticated  situa¬ 
tions,  and  are  often  used  for  data  entry  tasks.  Experience  suggests 
that  this  dialogue  type  is  unsatisfactory  for  systems  with  constant  or 
highly  experienced  users,  but  no  data  bearing  on  this  issue  are  known 
to  exist. 

Form-Filling 

Form-filling  is  often  used  in  situations  in  which  the  user's 
input  is  dominated  by  parameter  values,  rather  than  commands.  This 
dialogue  mode  naturally  allows  multiple  user  responses  in  a  single 
transaction  and  may  provide  more  contextual  information  concerning  the 
needed  information  than  does  a  Q  &  A  dialogue.  Effective  use  requires 
a  CRT  terminal  or  similar  device  in  which  user  input  can  be  positioned 
on  a  two-dimensional  display,  and  requires  a  terminal  tabbing  feature. 

Martin  (1973)  illustrates  several  versions  of  form-filling  dia¬ 
logues.  As  with  Q  &  A  dialogues,  little  empirical  data  exist.  Strub 
(1975)  compared  form-filling  with  unaided  user-initiated  input  of  text 
data  into  a  message-handling  system.  Unaided  input  was  40%  slower, 
but  this  is  not  a  statistically  reliable  result  because  of  the  study's 
small  sample  size. 

Pew  and  Rollins  (1975)  present  a  number  of  useful  recommendations 
with  respect  to  the  detailed  design  of  a  form-filling  dialogue.  This 
information  is  intended  for  use  by  system  designers  and  is  in  a  form 
appropriate  for  incorporation  into  a  design  guide.  It  is  not  highly 
detailed,  however,  and  should  probably  be  extended  somewhat. 

Menu  Selection 

Menu  selection  is  the  archetype  of  computer-initiated  dialogue. 
The  item(s)  to  be  selected  appears  directly  on  the  display.  The  user 
need  only  recognize  the  desired  action,  whereas  a  form-filling  or  Q  &  A 
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dialogue  may  only  prompt  the  user  for  a  response  which  must  still  be 
remembered.  Furthermore,  a  simple  menu-selection  dialogue  ordinarily 
requires  only  one  user  action  per  choice,  rather  than,  for  example, 
the  series  of  keystrokes  required  to  type  a  whole  word. 

Martin  (1973)  provides  examples  of  several  types  of  menu-selection 
dialogue.  Once  again,  there  is  little  empirical  data  pertaining  to  the 
circumstances  under  which  this  particular  dialogue  type  is  appropriate, 
although  Ridsdale  (1970)  reports  an  empirical  study  of  a  menu  selection 
dialogue  used  effectively  by  hospital  personnel  for  medical  data  entry. 
Among  the  study's  conclusions  are  that  the  "branching  questionnaire" 
technique  was  effective  for  this  data  entry  task,  and  that  menu  selection 
is  an  appropriate  form  of  dialogue  for  novice  users. 

Because  of  its  reliance  on  the  user's  passive  vocabulary  and 
recognition  memory,  menu  selection  is  a  particularly  natural  dialogue 
method  for  hierarchic  search.  It  is,  therefore,  very  popular  for  a 
variety  of  information  retrieval  dialogues.  Thompson  (1969,  1971)  provides 
an  excellent  discussion  of  such  dialogues,  with  examples  and  some  guide¬ 
lines  for  the  designer.  Uber  et  al  (1968)  discuss  this  and  other  applica¬ 
tions  of  menu  selection,  and  provide  well-considered  suggestions  for 
such  properties  of  the  dialogue  as  list  size  and  display  format. 

Menu  selection  is  also  useful  for  construction  of  commands,  when 
a  single  command  has  several  elements  such  as  a  verb  and  a  series  of 
arguments.  The  literature  contains  less  information  about  this  applica¬ 
tion,  probably  because  of  the  response-time  problems  it  creates.  When 
several  user  actions  are  required  to  support  the  construction  of  a  single 
command,  it  is  usually  necessary  that  any  system  response  which  must 
occur  within  this  process  be  very  rapid  ( e . g . ,  a  few  hundred  milliseconds). 
Thus,  menu  selection  is  ordinarily  not  an  appropriate  dialogue  type  for 
command  construction  unless  smart  terminals  or  a  dedicated  host  computer 
are  used.  When  response  i_s  rapid,  however,  this  dialogue  type  is  very 
effective  for  novices  and  experienced  users  alike. 
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Except  for  users  who  are  skilled  typists,  menu  selection  is 
most  effective  when  a  "point-in"  input  device  (e.g.,  light  pen,  touch 
panel)  is  used  (see  Earl  &  Goff,  1965).  When  an  ordinary  alphanumeric 
terminal  is  used,  it  is  usually  necessary  to  number  the  items,  or  pro¬ 
vide  some  similar  coding  scheme,  so  that  the  user  can  identify  the 
desired  option  by  code.  Pew  and  Rollins  (1975)  provide  some  reasonable 
suggestions  for  designers  using  this  approach.  Mode  mixing  is  an 
important  problem  when  a  point-in  selection  device  is  used.  If  the  user 
must  switch  back  and  forth  between  option  selection  (especially  with 
a  light  pen)  and  typing,  performance  may  be  adversely  affected  (Earl 
&  Goff,  1965).  Under  these  circumstances,  the  designer  should  consider 
either  (1)  an  extraordinary  effort  to  place  all  information  on  the  dis¬ 
play  in  selectable  form,  (2)  use  of  a  point-in  device  which  minimizes 
mode  mixing  problems  (e.g.,  a  trackball),  or  (3)  use  of  typed  codes 
to  select  menu  options. 

There  are  a  number  of  scattered  studies  which  provide  information 
useful  in  a  design-guide  discussion  o.  menu  selection.  For  example. 
Baker  and  Goldstein's  (1966)  finding  that  display  of  potential ly-but- 
not-currently-relevant  choices  interfered  with  user  performance  is 
relevant.  Ramsey  (unpublished  study)  found  that  serial  presentation 
of  successive  menus  in  the  same  area  of  the  display  is  preferable  to 
simultaneous  presentation  in  different  areas.  Engel  and  Granda  (1975) 
provide  useful  guidelines  for  formatting  of  menu  selection  displays. 

In  general,  the  information  in  the  literature  on  this  topic  has  not 
been  adequately  integrated,  but  could  be,  if  a  design  guide  is  under¬ 
taken. 

Function  Keys  With  Coirmand  Language 

Dialogues  which  use  function  keys  typically  have  some  of  the 
properties  of  computer-initiated  dialogue  and  some  user-initiated 
properties.  Although  the  user  may  have  to  construct  a  meaningful 
sequence  of  key  depressions  without  assistance  from  the  computer,  the 
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keyboard  itself  provides  a  memory  aid  which  allows  the  user  to  rely 
on  recognition,  rather  than  recall,  memory.  If  "programmable"  func¬ 
tion  keys,  with  back-lighting  or  even  variable  legends,  are  used,  the 
dialogue  can  be  a  pure  menu-selection  dialogue.  Ordinarily,  though, 
all  options  are  present  on  the  keyboard,  including  those  not  currently 
relevant. 


Although  many  systems  use  function  keys,  and  there  is  a  good 
deal  of  data  available  concerning  the  desirable  properties  of  the  key¬ 
board  itself,  there  is  very  little  information  in  the  literature  concern¬ 
ing  the  dialogue-related  aspects  of  function-key  interfaces.  The  best 
presentation  of  this  topic  is  in  Martin  (1973),  although  some  discussions 
of  specific  systems  (e.g.,  Wood,  1972)  contain  useful  information  on 
function-key  dialogues. 


User-Initiated  Command  Language 


The  most  common  dialogue  type  appears  to  be  the  user-initiated 
command  language.  This  dialogue  method  is  the  easiest  and  most  efficient 
from  the  computer's  viewpoint,  involves  the  lowest  electronic  communi¬ 
cation  overhead,  minimizes  user  waiting  time  during  command  construction, 
and  is  the  most  compatible  with  ordinary  teletypewriter  terminals.  It 
is  generally  preferred  by  system  designers  and  programmers  for  these 
reasons,  and  perhaps  because  it  is  more  like  ordinary  computer  programming 
than  the  other  dialogue  modes. 


This  is  also  the  dialogue  mode  which  inherently  provides  the 
least  assistance  to  the  user  in  acquiring  a  "mental  model"  of  the  system 
and  a  knowledge  of  the  functions  and  syntax  of  the  language.  Furthermore, 
the  syntax  tends  to  be  more  involved,  requiring  separators  (commas,  paren¬ 
theses,  etc.)  for  unambiguous  lexical  analysis  and  parsing.  Except  for 
very  simple  command  languages,  this  dialogue  mode  requires  a  trained  user, 
with  the  amount  of  training  varying  directly  with  the  complexity  of  the 
language. 
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A  user-initiated  command  language  is  probably  satisfactory  for 
computer-sophisticated  users  and  more  or  less  dedicated  users  who  can 
be  expected  to  undergo  significant  training.  In  other  contexts,  this 
dialogue  method  is  used  more  frequently  than  it  should  be,  and  has 
been  a  significant  source  of  errors  and  system  rejection  by  relatively 
unsophisticated  users. 

The  above  assertions  are  the  opinions  of  the  authors.  Although 
there  have  been  a  few  empirical  studies  dealing  with  the  desirable 
properties  of  command  languages,  we  found  no  empirical  data  evaluating 
this  basic  approach  to  interactive  dialogue,  or  comparing  it  with  others. 

We  also  found  no  well-integrated  source  of  information  to  assist 
in  the  design  of  such  a  command  language,  although  the  information  which 
was  found  might  support  such  an  integrated  treatment.  Much  of  the  basic 
research  which  deals  with  "command  languages"  is  concerned  with  the  ability 
of  humans  to  comprehend  assertions,  or  to  comprehend  and  execute  commands, 
rather  than  their  ability  to  generate  and/or  recall  commands  in  order  to 
accomplish  their  own  purposes.  Several  such  studies  are  reviewed  by 
J.  Newman  (1976).  Although  this  research  may  eventually  contribute  to 
our  understanding  of  command  languages,  these  two  tasks  are  sufficiently 
different  that  any  attempt  to  generalize  directly  from  one  to  the  other 
should  be  viewed  with  considerable  suspicion. 

Similarly,  most  of  the  research  which  has  been  done  on  programming 
language  properties  is  inapplicable  to  command  language  design  (although 
several  of  the  studies  provide  insight  into  query  language  design,  as  dis¬ 
cussed  in  the  next  section).  Most  of  the  studies  deal  with  language  pro¬ 
perties  not  usually  relevant  to  command  languages  (e.g.,  transfer  of 
control,  indentation),  and  almost  all  use  programmers  as  subjects. 

What  remains  are  a  handful  of  studies  bearing  on  specific  pro¬ 
perties  of  command  languages,  and  several  much  more  general  discussions. 
Watson  (1976)  provides  a  good  discussion  of  some  of  the  issues  in  command 
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language  design,  presents  examples  from  the  command  language  of  the 
NLS  system,  and  discusses  the  reasoning  behind  the  selection  of  its 
properties.  Although  Foley  and  Wallace  (1974)  emphasize  interactive 
graphics,  their  excellent  discussion  of  general  language  properties 
is  relevant  to  even  simple  conmand  languages.  Sayani  (1976)  presents 
a  reasonably  good  discussion  of  command  languages  for  operating  systems, 
with  an  emphasis  on  matching  the  language  to  the  user's  "mental  model" 
of  the  system.  Treu  (1975)  discusses  this  correspondence  in  more 
detail,  and  provides  a  model  of  the  sources  of  "mental  work"  involved 
in  mapping  between  the  user's  model  of  actions  to  be  performed  and 
the  set  of  commands  available  to  perform  them.  While  it  is  difficult 
to  see  how  the  model  might  be  tested  or  applied,  it  may  help  in  thinking 
about  the  problem. 

With  respect  to  overall  command  language  structure,  the  studies 
by  Carlisle  (1974)  and  Boies  (1977),  discussed  previously  in  the  section 
on  "complexity"  are  suggestive,  but  by  no  means  definitive. 

A  topic  of  greater  research  focus  is  the  use  of  positional  or 
keyword  syntax  for  parameter  values  in  commands.  Commands  with  keyword 
parameters  typically  have  a  form  such  as 

COMMAND ( KEYWORD=VAltJE ,  KEYWORD=VALUE ) 

while  positional  parameters  are  identified  by  the  position  of  the  values 
in  a  list,  as 

COMMAND (VALUE, VALUE). 

Although  the  positional  syntax  is  more  concise,  it  involves  higher  memory 
loads  for  the  user,  because  of  the  implicit  information  which  the  user 
must  provide.  Heafner  (1975)  presents  indirect  evidence  that  programmer 
users  of  a  command  language  prefer  a  keyword  syntax.  Weinberg  (1971) 
mentions  (but  does  not  describe)  a  study  in  which  programmers,  working 
with  relatively  unfamiliar  material,  performed  faster  with  a  positional 
syntax,  but  made  2-4  times  as  many  errors  as  with  a  keyword  syntax. 
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Abbreviation  of  commands  and  keywords  is  also  a  relevant  topic, 
on  which  little  specific  research  has  been  done.  Watson  (1976)  describes 
a  system  in  which  several  command  abbreviation  schemes  are  available  to 
the  user.  Of  these,  the  most  frequently  used  consists  of  single-character 
abbreviations  for  the  more  commonly  used  commands  (less  common  commands 
with  conflicting  first  characters  require  longer  abbreviations  for  disam¬ 
biguation).  "First-k-character"  algorithms  are  frequently  used.  Such 
abbreviated  input  is  important  for  the  experienced,  frequent  user,  and 
should  be  consistent  with  unabbreviated  command  usage  in  order  to  provide 
a  smooth  transition  for  the  user- in- training. 

The  use  of  default  values  is  also  helpful  in  promoting  ratural  and 
concise  dialogue.  Worthwhile  suggestions  are  found  in  Gilb  and  Weinberg 
(1977)  and  Martin  (1973),  but  no  directly  applicable  research  was  noted. 

Existing  "guidelines"  papers  (e.g.,  Engel  &  Granda,  1975;  Nickerson 
&  Pew,  1971)  provide  little  specific  help  with  command  language  design. 

They  do  provide  general  suggestions  concerning  such  matters  as  error 
messaging  and  overall  style.  There  do  not  appear  to  be  sufficient  re¬ 
search  data  to  support  definitive  guidelines  in  this  area,  but  this  is  an 
area  in  which  general  knowledge  of  human  information  processing  and  basic 
human  factors  principles  could  provide  considerable  guidance.  Without 
being  overly  speculative,  it  should  be  possible  to  develop  reasonable  human 
factors  guidelines  dealing  with  command  language  design.  It  is  somewhat 
surprising  that  this  topic  has  received  so  little  attention  on  the  human 
factors  side  of  the  literature.  Guidelines  in  this  area  should  address 
such  topics  as  command  language  structure  and  complexity;  statement  syn¬ 
tax,  including  keyword  and  positional  forms;  separators  and  terminators; 
abbreviations  and  processing  thereof;  default  values;  command  choice, 
including  minimization  of  both  syntactic  and  semantic  confusability; 
mechanisms  for  switching  to  computer-initiated  form;  error  messaging 
and  recovery;  command  stacking;  and  overall  dialogue  style. 
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In  a  query  language,  the  user  interacts  with  a  system  whicn 
has  access  to  a  data  base.  The  query  language  is  used  to  express  the 
user's  specific  request(s)  for  factual  information.  Ordinarily,  query 
languages  do  not  alter  the  data  base,  but  merely  allow  the  user  to  ask 
questions  about  the  data. 

Research  in  this  area  has  been  mainly  of  two  types.  Several 
very  basic  research  efforts  have  allowed  computer-naive  personnel  to 
express  queries  in  natural  language  or  in  more  constrained  languages, 
and  have  attempted  to  characterize  their  behavior  in  order  to  provide 
insight  into  query  language  use.  Other  studies  have  investigated  the 
useability  of  specific  available  query  languages,  usually  not  in  a 
comparative  fashion.  In  most  studies,  the  experimenter  has  provided 
queries  which  were  to  be  translated  into  a  specified  language.  The 
query  language  user,  however,  starts  with  a  need  for  information,  and 
must  formulate  a  question,  plan  an  approach,  and  encode  the  query. 

Only  Gould  and  Ascher  (1975)  have  really  addressed  this  overall  process. 

The  studies  of  specific  languages  have  generally  concluded  that 
existing  query  languages  can  be  learned  and  used  by  both  novice  and 
computer-sophisticated  users.  The  languages  investigated  include  a 
variant  of  IBM's  Interactive  Query  Facility  (Gould  &  Ascher,  1975), 
which  had  relatively  high  error  rates,  Zloof's  "Query  by  Example" 

(Thomas  &  Gould,  1975)  and  SQUARE  and  SEQUEL  (Reisner,  1977;  Reisner 
et  al  1975).  Although  these  languages  differ  considerably  in  basic 
philosophy  and  in  details  of  the  dialogue,  only  one  study  (Reisner) 
has  attempted  any  sort  of  comparison,  and  that  involved  two  relatively 
similar  languages.  Essentially,  comparative  data  on  basic  approaches 
to  query  languages  appear  to  be  nonexistant. 

Although  the  languages  listed  above  were  basically  useable,  many 
errors  were  made  In  their  use,  and  many  errors  emerged  from  the  more 


basic  studies  which  did  not  deal  with  currently  automated  languages. 
These  errors  provide  considerable  insight  into  the  kinds  of  logical 
constructions  and  query  language  features  which  present  difficulties 
for  users  (see  Figure  16).  All  of  the  results  appear  to  be  basically 
consistent  with  older  research  on  concept  identification  and  related 
topics. 


In  some  cases,  fairly  specific  conclusions  concerning  language 
features  seem  warranted.  For  example,  a  user  probably  should  not  be 
required  to  convert  "over  50"  to  "51  or  more",  and  elimination  of  the 
word  "or"  from  query  languages  might  avoid  errors  (Gould  &  Ascher 
recommend  alternatives  involving  special  punctuation  and  an  "ALSO" 
construct).  At  a  somewhat  higher  level,  Reisner  (1977)  concludes 
from  her  experiment,  on  both  novice  and  programmer  users,  that  a 
"layered"  or  partitioned  query  language  design  is  appropriate  and  makes 
suggestions  for  its  properties.  For  the  most  part,  however,  insufficient 
data  exist  to  support  broad,  detailed  guidelines  concerning  query 
language  design. 

In  addition  to  a  number  of  specific,  but  scattered  recommendations 
such  as  those  above,  some  higher-level  approaches  have  been  suggested 
which  may  make  query  languages  easier  to  use  and  less  error-prone.  The 
most  broadly  applicable  of  these  is  a  restatement  of  the  user's  query, 
by  the  system,  before  execution  (Gould  &  Ascher,  1975).  Such  a  restate¬ 
ment  should  paraphrase  the  user's  input,  rather  than  repeating  it 
as  entered,  and  should  be  suppressible  by  the  user  (it  should  probably 
be  included  with  printed  output  even  if  suppressed.  Clearly,  many 
of  the  categories  of  error  described  in  Figure  16  might  be  detected 
by  an  attentive  user  if  such  feedback  were  provided. 


Several  investigators  have  recommended  more  active  aids  to 
query  formulation.  Typically,  these  aids  have  been  proposed  in  a 
restricted  information  retrieval  domain  (usually  bibliographic  search) 
and  have  involved  a  gradual  interactive  negotiation  between  the  system 
and  the  user  (e.g.,  Caruso,  1970;  Green  1967;  Summit,  1971).  In  some 
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Figure  16.  Some  Known  Problem  Areas  in  the  Use  of  Query  Languages. 


cases,  a  serial  partitioning  of  the  task  is  advocated.  For  example, 
Wilde  (1969)  deals  with  the  difficulty  of  set  relations  by  partition¬ 
ing  the  query  formulation  into  a  search  term  identification  phase  and 
a  second  phase  in  which  the  logical  relations  among  the  terms  are 
specified.  Clarke  (1970)  suggests  aids  which  even  assist  the  user 
in  determining  appropriate  relations. 

An  issue  which  has  received  particular  attention  is  the  ability 
of  users  to  specify  queries  in  procedural  form.  Computer  programmers 
typically  express  queries  in  the  form  of  explicit  procedures  which,  if 
executed,  will  presumably  lead  to  the  desired  answer.  Miller  and  Becker 
(1974)  found  that  unconstrained  naive  users  tend  to  use  nonprocedural, 
or  descriptive,  specifications  which  are  often  ambiguous  or  incomplete, 
make  contextual  references,  and  refer  to  data  aggregates  rather  than 
individual  data  elements.  However,  a  later  study  by  Gould  et  al  (1976) 
seems  to  suggest  that  this  need  not  be  a  problem.  They  found  that,  with 
proper  task  instructions,  naive  users  could  employ  a  restricted-syntax 
procedural  query  language.  In  fact,  the  users  voluntarily  continued 
to  use  this  language,  in  preference  to  r . tu ra  1  language,  once  they  had 
been  exposed  to  i„.  Highly  procedural  query  languages  are  only  one 
step  removed  from  programming  languages,  and  there  is  a  great  deal  of 
literature  on  their  design  (some  of  it  based  on  empirical  study).  The 
survey  did  not  include  literature  on  this  topic,  however. 

An  excellent  study  by  Durding  et  al  (1974)  identified  some  capa¬ 
bilities  and  limitations  of  users  in  dealing  with  data  bases  having 
various  structures  (e.g.,  network,  list,  table,  hierarchy).  The  basic 
finding  is  that  users  can  work  with  all  of  these  structures,  but  onlv 
when  they  match  the  user's  perception  of  the  underlying  structure  of 
the  problem.  It  seems  to  follow  that  data  base  systems  and  query 
languages  which  arbitrarily  restrict  the  data  organization  to  a  ; 
cular  type  or  types  may  result  in  user  errors,  such  as  those  ot ‘ .  • .  *< 
in  the  study,  when  those  systems  are  applied  to  problems  whos<  tu> 
data  structure  is  different. 


Natural -Language  Dialogue 


Natural -language  dialogue  is  still  a  highly  experimental  area. 
Although  many  command  languages  exist  which  happen  to  use  verbs  and 
nouns  taken  from  a  natural  language,  these  are  not  at  all  what  is  meant 
by  "natural-language  dialogue".  The  ultimate  goal  here  is  to  be  able 
to  converse  with  the  computer  in  the  same  way  as  with  another  person. 

This  is  a  much  more  complicated  issue,  as  was  pointed  out  in  the  earlier 
discussion  of  "natural"  vs  "natural-language"  dialogue. 

Natural-language  dialogue  is  intended  to  allow  for  a  great  deal 
of  flexibility  on  the  part  of  the  user.  The  inherent  flexibility  of 
interpersonal  dialogue,  however,  makes  this  a  difficult  goal  to  achieve. 
Human  communication  is  characterized  by  ungrammatical  utterances  and  syn¬ 
tactic  and  semantic  ambiguities.  Several  authors  (e.g..  Hill,  1972;  Mont 
gomery,  1972)  have  cited  these  difficulties  and  have  argued  that  these 
difficulties  prohibit  the  development  of  natural-language  dialogues. 

It  is  difficult  to  describe  the  current  state  of  the  art  in  this 
area  because  of  its  rapidly  changing  characteristics.  There  is  a  great 
deal  of  research  effort  being  expended  here,  and  frequent  reports  of 
"success".  Yet  the  best  results  to  date  are  systems  which  are  very  sophi 
ticated  and  expensive  to  use,  require  extensive  "knowledge"  of  the  task 
domain,  and  have  somewhat  limited  vocabularies.  Since  operational  use 
of  unconstrained  natural -language  dialogue  is  not  yet  feasible,  the 
dialogue  must  be  constrained  in  some  way.  The  current  literature  on  man- 
computer  interaction  primarily  considers  two  such  constraints  and  their 
effects  on  user  performance.  Either  the  vocabulary  or  the  syntactic 
structures  of  the  language  are  limited. 

Syntactic  Constraints  and  Disambiguation 

There  is  some  evidence  (e.g.,  Carbonell,  1970)  that  limiting 
the  syntactic  structures  that  can  be  employed  may  not  seriously  affect 
the  "naturalness"  of  the  dialogue,  depending  on  the  number  and  nature 
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of  the  syntactic  structures  allowed.  However,  it  is  not  clear  what 
happens  to  the  useability  of  an  otherwise  natural  language  as  syn¬ 
tactic  constraints  are  applied.  Significant  dialogue-processing  dif¬ 
ficulties  can  be  eliminated  by  not  allowing  some  of  the  more  complex 
and  subtle  constructs  of  natural  language  (e.g.,  center  embedding, 
contextual  references). 

It  may  be  that  elimination  of  such  constructs  would  not  have 
significant  adverse  effects  on  performance.  On  the  other  hand,  a 
dialogue  which  seems  natural  may  encourage  the  use  of  normal  conver¬ 
sational  language,  including  those  constructs  which  have  been  eliminated. 
If  it  is  unclear  why  the  user's  apparently  clear  inputs  are  rejected  or 
even  misinterpreted,  the  adverse  effects  on  performance  and  system  accept¬ 
ance  could  be  quite  significant.  Research  is  needed  to  investigate  the 
effects  on  user  performance  of  those  types  of  syntactic  constraints  most 
likely  in  computer  dialogue. 

Another  aspect  of  syntax  which  is  particularly  important  concerns 
spelling  and  grammatical  errors  and  ambiguities.  Although  such  errors 
generally  present  little  difficulty  in  processing  interpersonal  dialogue, 
they  pose  serious  problems  for  the  designer  of  an  interactive  system. 

The  approach  that  is  frequently  taken  is  to  assume  that  all  user  inputs 
are  free  of  spelling  and  grammatical  errors  or  ambiguities.  In  this 
case,  the  success  of  the  user-computer  dialogue  depends,  in  large  part, 
on  the  user's  typing  (or  other  input)  and  language  skills. 

If  user  inputs  do  contain  ambiguities,  it  is  desirable  for  the 
computer  system,  rather  than  the  user,  to  detect  the  ambiguity  and 
initiate  some  sort  of  disambiguation  procedure.  In  the  systems  described 
by  Codd  (1974)  and  Plath  (1972),  for  example,  ambiguities  are  resolved  by 
presenting  the  user  with  a  list  of  possible  interpretations  and  asking 
the  user  to  select  the  appropriate  one.  These  systems  are  designed  for 
limited  domains,  however,  and  determining  alternative  interpretations 
is  somewhat  simplified  by  this  fact. 
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Ideally,  the  system  should  be  able  to  perform  part,  or  all,  of  the 
disambiguation  procedure  with  no  input  from  the  user.  Although  such  a  sys¬ 
tem  has  been  Implemented  in  the  context  of  an  interactive  programming  language 
(Teitelman,  1972)  where  the  syntax  and  semantics  are  rigidly  constrained, 
it  is  much  more  difficult  to  implement  in  a  natural-language  dialogue. 

Vocabulary  Constraints 

The  second  major  area  in  which  natural-language  dialogue  systems 
commonly  constrain  the  dialogue  is  in  the  semantics,  or  vocabulary,  in¬ 
volved.  In  part,  the  vocabulary  that  the  user  would  be  expected  to  use 
is  a  function  of  the  application  area.  It  is  only  in  extremely  simple 
domains,  however,  that  the  meaning  of  any  word,  or  group  of  words,  can 
be  clearly  and  unambiguously  defined.  In  most  areas  of  interest  to 
interactive  system  designers,  semantic  complexity,  in  totally  unconstrained 
dialogues,  cannot  be  avoided. 

As  with  syntactic  ambiguities,  a  common  method  for  circumventing 
these  problems  is  to  restrict  the  vocabulary  that  the  user  can  employ. 

Some  systems  do  not  constrain,  in  an  obvious  manner,  the  semantic  struc¬ 
tures  that  the  user  can  employ;  rather,  the  system  is  able  to  process 
only  a  subset  of  the  possible  structures  and  "ignores"  other  structures. 

The  vocabulary  that  could  be  used  in  either  interpersonal  or  user- 
computer  dialogue  can  be  viewed  as  consisting  of  two  subsets.  The  first 
subset  contains  common,  "general-purpose"  words  and  the  second  consists 
of  words  that  are  generally  relevant  to  a  particular  application  area. 

For  example,  bibliographic  search,  command  and  control,  etc.,  have  rather 
specialized  vocabularies.  In  a  general  sense,  the  application-specific 
vocabulary  is  used  to  convey  essential  information  while  the  general- 
purpose  vocabulary  conveys  little  actual  information  but  adds  "natural¬ 
ness"  to  the  dialogue. 

In  problem-solving  contexts,  it  has  been  demonstrated  that  vocabu¬ 
lary  limitations  on  interpersonal  dialogue  do  not  seriously  affect  per¬ 
formance,  provided  that  these  limitations  do  not  affect  the  problem- 
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specific  words  (Kelly,  1975;  Ford  1977).  A  similar  result  would  be 
expected  in  natural-language  dialogue,  provided  that  the  vocabulary 
was  selected  to  include  the  necessary  words  for  an  application  area. 


The  systems  described  by  Codd  (1974)  and  Plath  (1972)  employ 
semantic  restrictions,  in  addition  to  syntactic  restrictions,  and 
ambiguities  are  handled  in  the  manner  described  earlier  in  the  context 
of  syntactic  ambiguities.  The  system  developed  by  Carbonell  (1970) 
is  a  good  example  of  a  system  that  limits  the  vocabulary  to  a  subset 
that  is  selected  on  the  basis  of  the  application  area. 

Comments 

Progress  toward  eliminating  these  limitations  and  allowing  for 
completely  unconstrained  dialogues  is  being  made  in  the  area  of  artifi¬ 
cial  intelligence.  Although  this  approach  is,  ultimately,  the  most 
desirable,  it  is  unlikely  that  all  of  the  associated  problems  will  be 
overcome  in  the  near  future.  Examples  of  such  systems  can  be  found 
in  J.  S.  Brown  et  al  (1974),  Carbonell  and  Collins  (1970),  and  Grignetti 
et  al  (1974).  Currently,  such  systems  are  limited  to  small,  well-defined 
application  areas  and  they  can  not  be  applied  directly  to  other  domains. 

Eventually,  the  knowledge  and  techniques  being  generated  by  the 
development  of  such  systems  may  be  available  to  designers  of  interactive 
systems.  It  is  unlikely,  however,  that  any  of  these  benefits  will  be 
available  in  the  near  future.  It  is  also  important  to  recognize  that  the 
designer  of  a  system  will  rarely  implement  a  natural -language  dialogue 
capability.  More  commonly,  a  natural -language  dialogue  package,  which 
has  been  developed  elsewhere,  will  be  selected  for  incorporation  into 
the  system  being  designed.  It  is  possible,  then,  that  future  designers 
of  application  systems  will  be  more  concerned  with  selection  of  such 
dialogue  capabilities  than  with  their  t  ;ign  or  implementation. 


Interactive  Graphics 


> 


Interactive  graphics  is  not  actually  a  separate  dialogue  type  — 
it  can,  in  fact,  incorporate  any  of  the  dialogue  types  already  mentioned. 
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Nonetheless,  it  offers  enough  unique  advantages  and  problems  to  warrant 
a  separate  discussion.  Graphical  displays,  of  course,  allow  certain 
classes  of  information  to  be  communicated  to  the  user  at  very  high  in¬ 
formation  rates,  because  of  the  sophisticated  visual  information  pro¬ 
cessing  capability  of  humans.  If  they  are  well  designed  and  sufficiently 
rapid,  graphical  dialogues  can  be  extremely  "natural"  and  (from  the  user’s 
viewpoint)  efficient.  There  is  evidence  that  graphical  dialogues  are 
intrinsically  motivating, at  least  for  novice  users  (Solomon  &  Papert, 

1976,  discuss  this  in  connection  with  their  use  of  graphics  in  computer- 
assisted  instruction  for  children). 

Some  application  domains  are  inherently  graphical,  and  are  obvious 
candidates  for  graphical  dialogue.  Weinzapfel  and  Negroponte  (undated) 
suggest  that  interactive  graphics  may  allow  users  unskilled  in  spatial 
visualization  to  effectively  solve  problems  which  rely  heavily  on  such 
skills.  Graphics  can  be  especially  helpful  when  the  problem  to  be 
solved  has  multiple,  interacting  dimensions.  Krolak  et  al  (1971)  il¬ 
lustrate  a  case  in  which  the  use  of  graphics  allows  a  naturally  graphi¬ 
cal  (traveling  salesman)  problem  to  be  solved  using  a  particular  problem 
decomposition  which  would  be  unworkable  without  interactive  graphics. 
Furthermore,  interactive  graphics  can  be  very  effective  in  problem  areas 
which  are  not  inherently  graphical.  Balzer  and  Shirey  (1968)  present 
an  interesting  example  involving  the  "firing  squad"  problem.  Smith  and 
Crabtree  (1975)  present  a  particularly  creative  application  of  graphical 
displays.  In  their  experiment,  the  user  solved  a  graphical  problem  in¬ 
volving  the  fitting  of  rectangles  within  rectangles.  Their  performance 
was  superior  to  performance  on  the  real,  underlying  —  and  nongraphical  -- 
problem  (job-shop  scheduling)  to  which  the  graphical  representation  was 
mathematically  isomorphic. 

Interactive  graphics  has  a  voluminous  literature  of  its  own, 
most  of  which  deals  with  particular  applications  or  with  graphical  hard¬ 
ware.  In  our  survey,  we  attempted  to  include  those  documents  which  dealt 
fairly  specifically  with  human  factors  Issues  and  research.  Many  studies 
were  found  which  deal  with  specific  issues  of  graphical  input  or  output 
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devices  and  techniques,  although  research  on  relatively  recent  techno¬ 
logical  developments  in  graphics  was  noticeably  lacking.  These  studies 
are  discussed  in  later  sections.  Very  little  research  was  found  which 
dealt  with  the  higher-level  issues  of  the  overall  dialogue,  however. 

Graphics  is  an  area  in  which  research  lags  application  considerably. 
In  this  case,  the  response  of  the  application  community  has  been  to  pro¬ 
ceed  cheerfully  ahead,  making  its  decisions  on  whatever  pragmatic  bases 
are  available.  This  may  be  preferable  to  the  alternative  --  observed  in 
some  areas  of  software  development,  for  example  —  in  which  a  major  re¬ 
search  gap  has  been  filled  largely  with  unskilled  and  even  haphazard 
experimental  studies. 

Among  other  things,  the  dearth  of  quantitative  research  makes 
it  difficult  to  assist  the  designer  with  cost-performance  tradeoffs. 

Almost  any  interactive  computer  application  could  benefit  from  a  skillful 
use  of  graphics,  but  the  performance  advantages  would,  in  many  cases,  fail 
to  offset  the  significantly  higher  costs  of  interactive  graphics.  Even 
inherently  graphical  problems  may  not  justify  the  use  of  interactive 
graphics.  In  the  absence  of  performance  data,  tradeoff  decisions  are 
very  difficult,  and  are  often  made  for  reasons  that  have  little  to  do 
with  performance. 

« 

Modern  command-and-control  facilities  offer  some  of  the  greatest 
dialogue  design  challenges.  Each  of  these  systems  typically  has  mul¬ 
tiple  users,  widely  differing  in  information  needs  and  computer- related 
experience.  Such  facilities  often  involve  integration  of  a  wide  range 
of  display  and  input  devices,  including  interactive  graphics,  usually 
with  restrictions  on  device  choice  and  use  because  of  security  and 
survivability  requirements.  There  is  little  information  on  which  to  draw 
in  order  to  design  an  effectively  integrated  system  of  this  complexity, 
although  a  number  of  research  programs  are  now  attempting  to  address 
this  problem. 


At  the  level  of  ordinary  interactive  graphics  at  a  single 
workstation,  there  are  several  publications  which  contain  useful 
guidelines  or  examples.  For  suggestions  on  the  overall  philosophy  — 
and  some  details  of  good  graphical  dialogue,  Foley  and  Wallace  (1974) 
is  an  excellent  source  which  emphasizes  "natural"  communication. 

Martin  (1973)  provides  less  guidance,  but  contains  examples  of  a 
wide  variety  of  graphical  displays  and  dialogues  which  may  be  sugges¬ 
tive  to  the  designer.  Prince  (1971)  presents  a  good  overview  of 
graphics  as  applied  to  computer-aided  design. 

More  detailed  guidelines  in  such  areas  as  display  formatting, 
coding,  and  input  device  selection  and  use  are  discussed  in  later 
sections. 

EM8EDDED  TRAINING 

Embedded  training  is  the  use  of  an  interactive  system  to  train 
its  own  user.  Although  it  is  obviously  related  to  the  broader  field 
of  computer-assisted  instruction  (CAI),  embedded  training  occurs  only 
when  the  subject  matter  being  taught  by  CAI  is  the  use  of  the  same 
interactive  system  for  some  other  primary  purpose.  Another  term  com¬ 
monly  used  for  such  systems  is  "self-tutorial".  Embedded  training  has 
received  attention  since  the  early  days  of  interactive  systems,  and 
has  found  its  way  into  many  systems,  in  greater  or  lesser  degree,  but 
has  received  limited  attention  in  the  form  of  human  factors  research 
or  guidelines. 

In  order  to  render  its  boundaries  clearer,  it  may  help  to  dis¬ 
tinguish  embedded  training  from  some  other  facilities.  Computer- initiated 
dialogue  and  ordinary  error-messaging  and  recovery  features  may  help  the 
user  learn  the  system,  but  they  do  not  constitute  embedded  training,  nor 
does  a  simple  "help"  facility  which  provides  information  about  system 
commands  on  user  request.  Embedded  training  involves  an  explicit  tutorial 
package  which  teaches  the  user  how  to  perform  the  mechanical  operation 
of  the  system  and  terminal,  or  how  to  utilize  the  system  in  the  user's 
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task,  or  both.  It  is  also  appropriate  to  distinguish  embedded  train¬ 
ing  from  job  performance  aids,  which  may  be  tutorial  in  nature,  but 
are  intended  for  use  each  time  the  user's  task  is  performed,  even  after 
the  user  is  experienced. 

Two  investigators  report  explicit  attempts  to  assess  the  effect¬ 
iveness  of  embedded  training.  Goodwin  (1974)  describes  (with  illustra¬ 
tions)  a  training  facility  which  acquaints  new  users  with  a  particular 
terminal  workstation.  The  emphasis  in  this  package  is  on  the  mechanical 
use  of  the  terminal,  including  display,  keyboard,  and  lightgun.  The 
package  is  simple,  but  well  done.  Morrill  (1967)  presents  a  less  detailed 
description  of  an  embedded  training  package  in  a  management  information 
system.  Morrill's  tutorial  facility  teaches  not  only  use  of  the  terminal, 
but  also  the  dialogue  of  the  system  and,  to  a  limited  extent, the  use  of 
the  system  in  data  acquisition. 

Each  of  these  studies  reports  performance  data  on  the  use  of  the 
system  by  trainees,  but  neither  provides  any  sort  of  comparison  with 
conventional  training  or  any  other  training  approach.  In  both  cases, 
the  data  suggest  that  the  training  was  effective,  but  are  not  really 
definitive  without  some  basis  for  comparison.  Both  reports  present  good 
ideas  and  examples  for  the  construction  of  such  packages. 

There  have  been  many  anecdotal  reports  that  embedded  training 
is  affective,  and  it  would  be  quite  surprising  if  well-designed  tutorial 
packages  failed  to  convey  a  training  advantage  in  most  situations.  Learn¬ 
ing  on  the  system  itself  provides  a  better-than-usual  opportunity  for 
high  transfer-of-training,  and  computer  systems  are  ideal  for  providing 
rapid  feedback,  performance  tests  integrated  with  the  training,  etc. 

In  particular,  it  should  be  noted  that  embedded  training  provides  a 
better  training  situation  than  does  CAI  in  general,  because  of  the 
similarity  of  the  training  situation  to  the  ultimate  job  to  be  performed 
by  the  user. 
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The  surveyed  literature  contains  several  examples  of  much 
more  sophisticated  tutorial  packages.  In  general,  these  are  inter¬ 
active  problem-solving  or  decision-aiding  systems  with  significant 
tutorial  aspects.  The  tutorial  aids  are  primarily  associated  with  im¬ 
proving  the  user's  decision-making  or  problem-solving  behavior,  and 
so  constitute  job  performance  aids  or  CAI  systems,  rather  than  embedded 
training  £er  se.  They  do  contain  useful  ideas  for  embedded  training, 
however.  Among  these  papers  are  Allan  (1968),  Baldwin  (1977),  J.  S. 
Brown  et  al  (1974),  Carbonell  (1970),  Collins  et  al  (1974),  and  Lums- 
daine  et  al  (1970).  Caruso  (1970)  describes  a  system  which  nicely 
integrates  tutelage  on  problem  solving  in  bibliographic  search  tasks 
with  tutelage  on  the  mechanics  of  the  system  itself. 

Undoubtedly,  the  most  sophisticated  example  of  embedded  training 
found  in  the  survey  is  NLS-Scholar  (Grignetti  et  al,  1974).  Scholar 
is  a  fairly  sophisticated,  natural-language  CAI  system  which  can,  in 
principle,  be  used  to  teach  in  a  wide  variety  of  task  domains.  One 
of  the  areas  in  which  it  has  been  applied  is  in  training  the  user  of 
NLS  (oN  Line  System),  a  powerful  system  used  for  document  preparation, 
communication,  and  a  variety  of  interactive  problem-solving  tasks. 

This  may  be  an  important  approach  in  the  future.  The  factors  which  make 
natural  language  difficult  to  apply  in  many  domains  (especially  the  com¬ 
plexity  of  the  required  domain  knowledge)  are  more  manageable  in  the 
context  of  teaching  an  interactive  system  whose  properties  are  already 
formally  defined. 

At  present,  good  (and  bad)  examples  are  about  all  one  can  point 
to  in  the  way  of  a  literature  on  embedded  training.  However,  there  is  a 
large  literature  dealing  with  CAI,  course-authoring  languages,  etc., 
which  may  be  relevant.  Any  guidelines  built  for  the  designer  in  the 
near  future  will  probably  have  to  rely  heavily  on  general  principles 
for  the  construction  of  training  materials,  with  examples  of  the  appli¬ 
cation  of  such  principles  to  embedded  training. 
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DIALOGUE  SPECIFICATION  AND  CONTROL  TECHNIQUES 

The  preceding  discussion  of  interactive  dialogue  has  concentrated 
on  the  selection  of  an  appropriate  dialogue  type,  the  properties  which 
such  dialogues  should  have,  and  on  the  possibilities  for  developing  guide¬ 
lines,  on  these  topics,  for  use  by  interactive  system  designers.  The 
techniques  used  to  specify  and  to  control  interactive  dialogues  are  normally 
considered  to  be  an  implementation  matter,  and  are  not  usually  discussed 
in  connection  with  human  factors  issues.  However,  we  feel  that  these  topics 
warrant  discussion  for  three  reasons.  First,  the  choice  of  a  dialogue 
control  technique  often  determines  properties  of  the  dialogue.  Even  with 
a  good  high-level  dialogue  design,  undesirable  features  often  creep  into 
the  system  as  a  result  of  this  "implementation"  decision.  The  second 
reason  is  somewhat  corollary  to  the  first.  If  good  dialogue  control  tech¬ 
niques  can  improve  the  behavior  of  a  system  --  and  perhaps  make  it  easier 
to  implement,  as  well  --  then  by  making  the  designer  aware  of  such  tech¬ 
niques  we  may  help  the  designer  to  produce  better  systems.  The  third 
reason  may  be  somewhat  ambitious,  given  the  current  state  of  the  art,  but 
it  is  potentially  the  most  important  of  the  three.  As  dialogue  specifi¬ 
cation  and  control  techniques  become  more  comprehensive  and  powerful,  it 
may  be  possible  to  "build  in"  good  human  factors.  At  least  some  aspects 
of  the  dialogue  might  be  made  to  conform  to  "good"  human  factors  princi¬ 
ples  without  explicit  attention  by  the  designer. 

Pew  and  his  colleagues  (Pew  &  Rollins,  1975;  Pew  et  al ,  1976) 
have  developed  a  set  of  dialogue  specification  techniques  whose  explicit 
goal  is  this  kind  of  "built-in"  human  engineering.  These  are  not  dialogue 
control  techniques  --  by  which  the  computer  executes  the  interactive  dia¬ 
logue  --  but  are  methods  used  by  the  designer  to  communicate  the  desired 
dialogue  to  the  implementor.  The  proposed  approach  represents  a  very 
good  first  step,  but  covers  only  relatively  simple  menu  selection  and 
form-filling  dialogues.  It  is  not  clear  that  this  approach  can  easily 
be  extended  to  more  complex  dialogue,  but  it  can  probably  be  applied  to 
any  dialogue  types  which  are  basically  "frame"  oriented  ( i . e . ,  built 
around  a  series  of  relatively  static  displays  or  subdisplays). 
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It  is  possible  that  these  frame-oriented  specification  methods 
could  be  extended  to  dialogue  control  in  such  a  way  that  the  bulk  of 
the  detailed  dispiv  formatting  and  input  handling  would  be  accom¬ 
plished  automatically  and  in  accord  with  a  reasonable  set  of  a  priori 
human  factors  criteria.  This  has  been  done  in  connection  with  individ¬ 
ual  projects,  but  not,  to  our  knowledge,  in  any  very  powerful  or  generic 
way.  Typically,  the  capabilities  of  such  dialogue  control  packages 
have  been  strongly  influenced  by  very  specific  features  of  the  dialogue 
which  they  were  designed  to  support. 

For  question-and-answer  dialogues,  fairly  generic  dialogue  con¬ 
trol  software  does  exist.  Carlisle  (1972)  discusses  a  language  ("HELP") 
based  on  constructs  similar  to  IF-statements,  and  Martin  (1973)  reviews 
several  simple  course-authoring  languages,  showing  their  applicability 
to  interactive  dialogue  control.  In  the  case  of  these  simple  Q  &  A 
dialogues,  display  formatting  and  input  processing  are  trivial,  and  a 
specification  of  the  verbal  inputs  and  outputs  is  virtually  a  comnlete 
specification  of  the  dialogue. 

Several  generic  mechanisms  exist  for  controlling  the  verbal 
component  of  more  complex  dialogue  types.  Feyock  (1977)  discusses  the 
use  of  state-transition  diagrams  to  describe  command  languages,  and 
points  out  that  they  allow  a  fairly  automatic  mechanism  for  controlling 
both  user-and  computer-initiated  dialogue,  help  facilities,  and  some 
classes  of  error  messaging  and  recovery.  If  the  software  which  controls 
such  facilities  already  exists,  the  designer  need  only  specify,  via  state- 
transition  diagrams,  the  legal  commands.  The  remainder  of  the  dialogue 
is  determined  by  the  properties  of  the  pre-existing  software.  It  is  this 
kind  of  approach  which  offers  some  potential  for  "built-in"  dialogue 
characteristics  to  which  the  designer  need  not  attend.  If,  for  example, 
a  state-transition  diagram  is  used  to  control  a  menu-selection  dialogue 
with  a  lightpen,  such  matters  as  frame  formatting,  menu  placement  and 
spacing,  and  input  processing  for  the  lightpen  are  not  part  of  the  state 
transition  diagram,  but  of  the  controlling  software. 
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State-transition  diagrams  are  satisfactory  only  for  the  control 
of  syntactically  simple  command  languages.  Other  mechanisms  sometimes 
used  for  such  languages  include  IF  statements  and  keyword  tables.  Kasik 
(1976)  reviews  these  approaches,  and  offers  a  variation  on  the  state- 
transition  diagram,  the  "triply-linked  tree".  For  syntactically  simple, 
hierarchic  command  languages,  this  approach  allows  automatic  processing 
of  the  sorts  discussed  above,  and  allows  level-skipping  and  some  other 
potentially  useful  features. 

None  of  these  mechanisms  has  much  syntactic  flexibility,  and 
they  may  very  well  give  the  designer  a  feeling  of  great  rigidity  and 
constraint.  Two  dialogue-control  mechanisms  exist  which  are  much  more 
powerful  and  flexible  --  production  systems  and  language  grammars.  These 
control  mechanisms  are  also  much  more  sophisticated,  and  may  not  be  use- 
able,  in  raw  form,  by  more  than  a  small  percentage  of  designers. 

Production  systems  are  a  familiar  tool  in  artificial  intelligence, 
and  have  been  incorporated  as  the  overall  control  mechanism  for  the  Rand 
Intelligent  Terminal  Agent  (RITA;  Anderson  &  Gillogly,  1976).  Produc¬ 
tion  systems  are  very  powerful,  and  are  theoretically  capable  of  perform¬ 
ing  any  definable  function,  but  it  is  not  at  all  clear  that  they  are  a 
natural  mechanism  for  the  control  of  sophisticated  interactive  dialogues. 
RITA  is  impressive  in  that  it  can  execute  complex  procedures  for  the  user 
(e.g.,  logging  onto  the  ARPANET),  but  it  has  not,  to  our  knowledge, been 
made  to  interact  with  the  user  in  other  than  fairly  simple,  verbal  com¬ 
mands. 

Production  systems  are  a  relatively  specialized  and  difficult 
form  of  programming.  Very  few  interactive  system  designers  currently 
possess  the  skills  required  to  use  them  effectively.  It  is  likely,  how¬ 
ever,  that  interactive  aids  for  this  programming  task  may  be  forthcoming 
in  the  next  few  years.  It  remains  to  be  seen  whether  this  will  be  a 
viable  mechanism  for  the  control  of  "ordinary"  interactive  systems,  or 
only  a  tool  for  research  and  for  the  control  of  highly  specialized 
"knowledge-based"  systems. 
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Although  language  grammars  lack  the  power  of  production  systems 
for  manipulation  of  a  knowledge  base,  they  are  a  much  more  natural 
mechanism  for  control  of  interactive  dialogue.  The  use  of  a  grammar 
to  describe  the  syntax  of  a  command  language  or  programming  language 
is  familiar  to  most  modern  designers.  By  "augmenting"  such  a  grammar 
with  information  about  the  corresponding  system  actions,  the  grammar 
can  become  a  powerful  dialogue  control  mechanism  (see  Lenorovitz  & 

Ramsey,  1977). 

All  of  these  approaches  are  basically  verbal,  and  require  signi¬ 
ficant  extension  in  order  to  control  graphical  dialogues.  These  exten¬ 
sions  include,  at  the  least,  such  things  as  display  drivers  and  graphical 
input  processors.  For  greater  transportabil ity,  they  may  include  a 
device-independent  graphical  language  with  a  translator  for  the  specific 
display  and  input  devices.  This  approach  has  now  been  developed  to  the 
point  that  its  use  in  operational  systems  should  be  commomplace  in  a 
few  years.  Graphical  information  can  be  described  directly  with  grammars 
or  similar  means  (e.g.,  Carbonell,  1971),  but  cost-effective  operational 
•e  of  this  technique  seems  to  be  many  years  away. 


OUTPUT  DEVICES  AND  TECHNIQUES 


There  is  clearly  a  good  deal  of  interaction  between  the 
selection  of  a  dialogue  type,  design  of  a  command  language,  etc., 
and  the  selection  or  design  of  the  input/output  devices  which  will 
be  used  to  implement  that  dialogue.  Decisions  concerning  these 
topics  are  addressed  separately  here,  in  order  to  allow  a  manageable 
discussion  of  the  issues.  In  general,  decisions  concerning  impor¬ 
tant  aspects  of  the  dialogue  should  precede  input/output  device  de¬ 
cisions.  In  practice,  the  opposite  often  occurs,  and  we  have  seen 
several  instances  in  which  the  selected  terminal  devices  were  incap¬ 
able  of  conducting  a  dialogue  acceptable  for  the  task  at  hand. 

As  the  focus  shifts  from  dialogue  to  output  (display)  devices, 
the  available  empirical  data  which  can  be  used  to  support  design  guide¬ 
lines  becomes  both  more  voluminous  and  more  directly  relevant.  A 
great  deal  of  human  factors  research  has  been  done  on  displays  and 
display  devices,  and  this  research  has  concentrated  heavily  on  elec¬ 
tronic  displays  for  several  years.  For  some  design  issues  in  this  area 
(e.g.,  display  luminance,  contrast,  refresh  rate,  etc.,)  reasonable 
guidelines  already  exist  and  can  simply  be  incorporated  into  a  more  com¬ 
prehensive  design  guide.  There  remain  areas  in  which  guidelines  are 
still  inadequate  or  nonexistent. 

DISPLAY  TYPES 

One  such  area  is  the  selection  of  the  basic  type  of  display  de¬ 
vice  to  be  used  in  the  system.  Although  teletypewriters  and  alphanumeric 
cathode-ray-tube  displays  are  the  most  popular  choices,  numerous  other 
possibilities  exist,  including  plasma  displays;  LED  and  liquid  crystal 
displays;  tactile  displays;  audio  displays,  including  automated  speech 
generation;  graphical  displays,  including  three-dimensional  graphics; 
laser  displays;  and  even  psychophysiological  output  devices. 

Unfortunately,  there  has  been  little  human  factors  research  on 
overall  performance  with  most  of  these  display  types  (see  Figure  17). 
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Laser  displays  Reasonable  human  factors  guidelines  with  respect  to  visual  properties  Gould  &  Makous 
have  been  proposed,  but  these  displays  are  not  widely  used.  (1968) 


Noll  (1972) 

Landis  et  al 
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(1965)** 

1 

1 

CO  1 

#» 

4->  M-  3  "O 

-o 

U  O  +->  1  C 

-- — . 

cd 

1 

a)  S-  3  <T3  >, 

XJ 

CL 

o 

P-  CD  Or-  03 

O) 

O  1 

.X 

*4-  O  <+-  O  «  >1' — 

XJ 

c 

u 

<D  C  C  CO  ' — -  (T3  Q_ 

3 

CD  o 

>. 

4-  03  33  03  t/>  I—  in 

t— 

>  o 

to 

03  O  4->  S-  >,  CL-r- 

u 

<v 

CL 

U  to  tO  to  XJ 

c 

XJ 

• 

C  CO  *r-  •  CD  r—  -r- 

o 

tO 

-P>  _C 

to  CD  X  CO  JZ  CLT3  »— 

CJ 

C  -C 

3  O 

E  C71  03  l-  4->  CO  03 

CD  4-> 

XJ  i- 

L.  to  CD  -i-  C  3 

> 

to 

0  4JIUJ  OT3  4M3 

to 

0)  c 

e  (D 

P  CZ  (DP  CD  -r- 

(D 

(C 

5  to 

L  to  Pt  CD  L  > 

CL 

s_  x: 

O  CD 

CD  >  >  CD  >  CJ 

>> 

O  4-> 

c  s- 

CLTJ  XJ  3  r—  to  XJ 

i- 

(O  C  <—  T3  03  1  C 

-o 

CD  S- 

CD  to  f—  >  CD  *r- 

CD 

07  CD 

»—  O 

-c  c  to  cd  cn 

U 

tO  -X 

_o  P- 

+->•*-  tO  r-  4->  L-  C 

•r- 

O  4-J 

•r— 

03  03  S-  X3  XT  03  03 

> 

CL  O 

to  u 

o  E  s-  O  <c  air- 

CD 

o 

(O  •!— 

+->  03  t4-  >  -i-  c 

Q 

CD 

CD  Q- 

0)  CD  r-  CD  to 

• 

CL  C 

<4-  O 

4->  x:  >,  <u  -r-  x:  x: 

to 

>> 

O 

4-J 

uh  to  Er  *>p  p 

>, 

to 

C  X 

>> 

CD  1—  to  CJ  . 

to 

CD 

r—  to 

Q.  CL  CO  03  CO  <U  aj 

r— 

CL 

CD  C 

r— 

CO  •  CO  •  CJ  1 — 

CL 

to 

XJ  CD 

(O  >> 

<uto-i-ai4-><i)ccn 

to 

•r— 

CD 

CJ  1— 

S-  >,XJ  -C  O - 1—  C 

*r- 

Q 

CD  -0 

•r-  C 

to  4->  C  CO  to 

Q 

> 

c  o 

-c  s-  >, 

• 

U 

(T3  to 

-C 

4J  a  ai  >)  to  di  «>i — 

to 

r— 

•r— 

JC  to 

CJ  r- 

•r-  CO  COi —  -r-  O  (U  03 

r— 

to 

to 

JZ 

CD  f— 

3  -r-  1-  S-  o-  3 

to 

u 

to 

to 

P->  -r- 

o  to  top  o  ato 

c 

•r— 

co 

>>-c 

P-> 

CD  r-  CD  C  C  *r-  t- 

•r— 

-C 

03  CJ 

CO  to 

Ur-  r—  CD  -C  U  > 

CL 

r—  C- 

•r- 

C  (O  tO  U  4->  U  C 

Cl 

to 

• 

CL  (O 

CD 

CD  3  C  CD*r  L 

CD 

L. 

00  CD 

+->  S- 

T3X3CDtOOPi-CD 

s- 

h- 

CD 

r— 

•r*  CO  • 

3  tO 

•r—  *r-  L  t—  U  Q_  r— 

CD 

XJ  CD  to 

CL 

>  >  to  cn  r— 

to 

P- 

P- 

CD 

p  CJ 

c  to 

03  *r-  ^  >,  C  C  03 

3 

o 

O 

S- 

CD  •«- 

•r-  >, 

X3  to  u  to  E 

3 

r —  l/l  4-> 

to 

03  C  >>••-  r—  -M  to 

CD 

c 

c 

cn 

•r-  U  CD 

r—  i— 

c  •»-  to  x:  cl  to  cd 

o 

o 

4->  O  X= 

m  o. 

•r—  r—  ^  tO  *r-  r—  fQ 

P-> 

•r— 

•r— 

Li_ 

U  -P  -P 

O  to 

p  to  a  *r  x  a 

to 

to 

03  CJ  (O 

•r  *r* 

U  >  to  >,X3  CD  to  to 

O 

to 

to 

P  fD  o 

OlT) 

•r-  *r—  tO  >  XJ 

4-> 

3 

3 

4-  S- 

o 

»—  CL  XJ  r—  £_  4_  CD  C 

u 

U 

CD  CL 

r— 

P-3  CL  CD  O  *r-  CD 

CD 

to 

to 

E  c 

o  to 

c  o  c  to  jz  x: 

to 

•r- 

•r 

o  icr 

•r-  <J 

OV-03-I-C7)COOX1 

O 

XJ 

XJ 

</i  E  4-> 

to  »r 

U  cn  CD  XJ  •»-  P->  to  3 

3  •»- 

>,  cn 

IS-  X'r  c  CO 

u 

i- 

s- 

.c  x:  3 

x:  o 

CO  0J  O  03  E  3 

CD 

CD 

OI 

Q_  i — 

•r-  OilOr-  r.,-  >s  XJ 

4-> 

PJ 

3  03  "O 

o  o 

S-  1  CO  >>  r—  (D  r— 

CD 

(O 

to 

O  *—  CD 

JZ  -r- 

CD  (O  CD  C  r-  XJ  r- 

4-> 

r— 

r— 

x:  p  c 

CJ  to 

S-  r-  OVr-  CD  C  (O 

to 

PPL 

>>  >> 

03  S-  CO  4->  O  >,  3 

u 

CD 

CD 

i—  CD 

to  _c 

x:  Cf-  to  to  *r  ro  to 

o 

CD 

CD 

<£  r—  u 

CL  CL 

1—  O  ■—  03  C  +j  E  3 

(/> 

IS) 

1 

o 

c 

•P“ 

CD 

to 

CD 

>i 

i- 

CD 

r— 

to 

-C  to 

U  to 

C 

to  to 

<v  >, 

CLf—  >, 

(O  >, 

o 

U  >> 

»— 

o  to  to 

1  to 

-C 

•I—  to 

•i—  r— 

r  Ur- 

<D  f— 

CL 

x:  r- 

4J  CL 

U-r  a 

cn  cl 

CD 

CL  Q. 

CJ  CO 

>>  O)  to 

1-  to 

(O  to 

to  *r- 

1/1  O’r 

to  -r- 

CD 

L-  -r- 

h-  T3 

Q_  r—  XJ 

_l  XJ 

t— 

CD  XJ 

In  the  case  of  plasma  displays,  for  example,  the  only  relevant  research 
found  in  the  survey  was  concerned  with  punctate  versus  stroke  symbol 
generation.  This  topic  is  discussed  in  a  later  section.  Nothing  was 
found  on  light-emitting-diode  displays,  but  they  are  discussed  pri¬ 
marily  outside  the  computer  systems  literature.  Gould  and  Makous  (1968) 
provide  useful  recommendations  concerning  the  visual  properties  of  laser 
displays,  although  such  displays  have  not  lived  up  to  their  early  pro¬ 
mise  and  are  not  widely  used. 

Tactile  alphanumeric  displays  seem  to  hold  little  promise,  except 
possibly  for  blind  readers  (Shackel  &  Shipley,  1970),  but  tactile  dis¬ 
plays  have  been  proposed  for  "graphical"  information  (e.g.,  Noll,  1972). 
There  has  been  little  human  factors  research  on  such  displays,  at  least 
in  the  computer- related  literature.  Psychophysiological  displays  have 
been  the  subject  of  some  research,  including  even  the  possibility  of 
stimulation  devices  surgically  implanted  over  the  visual  cortex,  but 
no  application  of  even  the  less  esoteric  forms  appears  likely  anytime 
soon. 


Computer-generated  speech  is  not  yet  widely  used  for  real  appli¬ 
cation  work,  but  it  is  definitely  available  "off-the-shelf"  now.  Turn 
(1974)  presents  a  fairly  thorough  review  of  the  area,  including  problems 
and  possibilities.  Most  human  factors  research  and  guidelines  applicable 
to  auditory  displays  are  concerned  with  warning  signals  and  similar  matters, 
but  not  with  automated  speech.  Witten  and  Madams  (1977)  present  a  study 
which  suggests  that  even  relatively  low-quality  speech  is  acceptable  for 
casual  users.  Smith  and  Goodwin  (1970)  make  several  suggestions  for 
applying  automated  speech.  These  suggestions  concentrate  on  the  user's 
ability  to  pace  and  control  the  dialogue  and  on  the  use  of  different 
voices  and  tone  codes  to  convey  information  and  indicate  required  user 
actions. 


The  more  basic,  and  unresolved,  issue  concerns  the  recognition 
of  tasks  in  which  automated  speech  is  a  viable  alternative  to  visual 
displays.  Speech  is  characterized  by  low  bandwidth,  but  has  the 
advantage  that  visual  attention  is  not  required.  It  is  thus  useful 


when  the  user  has  some  visual  task,  and  is  certainly  good  for  warning 
signals  in  situations  in  which  operator  vigilance  is  suspect  because 
of  competing  tasks,  long  hours,  etc.  From  the  literature  surveyed,  it 
is  not  clear  that  any  more  extensive  guidance  can  be  offered  with  re¬ 
spect  to  the  use  or  nonuse  of  such  displays. 

Chapanis  (1975,  1976)  and  several  colleagues  have  reported  an 
extensive  research  program  in  which  two-person  teams  solved  problems 
while  communicating  in  various  media  (e.g.,  voice,  video,  teletype¬ 
writer,  handwriting).  A  consistent  finding  throughout  this  program  is 
that  problem  solving  is  more  effective  when  voice  communication  is 
allowed.  However,  it  is  not  clear  just  how  this  result  applies  to  inter¬ 
active  computer  systems.  Clearly,  most  of  the  advantage  is  achieved  by 
engaging  in  a  natural-language  dialogue  which  is  beyond  the  capabilities 
of  our  current  technology.  It  may  also  be  more  important  to  have  auto¬ 
mated  speech  input  than  output,  since  visual  displays  provide  a  very 
high  rate  of  information  transfer,  but  input  information  rates  tend  to 
be  relatively  low  with  virtually  all  input  media. 

Dialogue  recommendations  pertaining  to  speech  output,  such  as 
those  offered  by  Smith  and  Goodwin,  should  be  incorporated  into  a  de¬ 
sign  guide,  and  loose  guidelines  dealing  with  vocabulary,  phonetic  dis- 
criminability,  etc.,  could  probably  be  developed  based  on  existing 
knowledge.  Overall,  though,  the  literature  does  not  seem  to  provide 
a  great  deal  of  support  for  design  decisions  involving  speech  output. 

Teletypewriter  and  cathode-ray-tube  (CRT)  displays  have  been 
subjected  to  far  more  extensive  empirical  study.  Reasonable  guidelines 
exist  for  CRT  visual  properties,  including  character  generation  (see 
next  section).  Reasonable  guidelines  also  exist  with  respect  to  the 
physical  and  functional  properties  of  teletypewriter  terminals  (e.g., 
Dolotta,  1970;  see  later  section  on  Terminals).  Even  with  these  well- 
studied  display  devices,  though,  there  has  been  very  little  empirical 
study  which  is  concerned  directly  with  the  selection  of  one  display 
device  or  another.  Evans  (1976)  reports  that  medical  patients  prefer 
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a  slow  (10  cps),  noisy  teletype  to  a  CRT  terminal  for  interacting 
with  an  automated  medical  interviewing  system.  Martin  and  Chubb 
(1970)  report  that  users  in  a  logistics  management  task  found  CRTs 
much  less  boring  than  a  hardcopy  device,  but  no  performance  dif¬ 
ferences  were  reported.  Fetter  and  Carlisle  (1971)  compared  CRT 
and  teletype  terminals  for  use  in  information  retrieval  tasks  associ¬ 
ated  with  a  legal  information  system.  Users  took  longer,  made  more 
errors,  and  retrieved  slightly  more  information  when  using  the  CRT, 
but  the  CRT  is  said  to  be  poor  by  modern  standards. 

It  seems  unlikely  that  studies  at  this  level  will  provide  any 
satisfactory  resolution  to  the  device  selection  issue.  The  utility 
of  various  display  devices  is  fairly  task-specific,  and  probably 
cannot  be  resolved  in  such  a  grneral  way.  Guidelines  on  device  se¬ 
lection  will  probably  have  to  be  procedural  and  analytical  in  nature, 
helping  the  designer  to  understand  the  relevant  aspects  of  the  selected 
dialogue  method  and  the  user's  task(s)  and  requirements  for  displayed 
information.  In  some  cases,  such  an  analysis  will  clearly  eliminate 
some  candidate  display  devices.  In  most  cases,  though,  the  designer 
may  face  a  tradeoff  decision  involving  several  candidates.  At  present, 
the  best  we  can  expect  of  human  factors  guidelines  is  that  they  will 
inform  the  designer  of  the  relevant  issues. 

VISUAL  PROPERTIES  OF  DISPLAYS 

Reasonably  good  guidelines  already  exist  for  almost  all  of  the 
basic  visual  properties  of  refreshed  CRT  displays  (see  Figure  18)  and 
—  by  extension  --  of  storage-tube  and  plasma  displays,  although  care 
is  obviously  required  in  making  this  generalization.  The  phrase  "basic 
visual  properties"  is  used  here  to  connote  those  simple  display  pro¬ 
perties,  such  as  luminance  and  contrast,  which  are  more  or  less  inde¬ 
pendent  of  informat  _,n  content  and  format.  Although  there  is  consider¬ 
able  research  dealing  with  these  properties,  it  is  important  to  note 
that  older  studies  of  displays  illuminated  by  reflected  light  are  not 
particularly  relevant  here. 
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Flicker  Perceptible  display  flicker  can  cause  irritation  and  visual  fatigue,  Dill  &  Gould  (1970) 

and  may  impair  visual  performance.  Flicker  is  a  function  of  regener-  Gould  (1968)* 

ation  rate,  phosphor  persistence  and  chromaticity,  and  size  of  dis¬ 
play  element.  Beam  scanning  sequence  (as  in  interlaced  raster  scan- 
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Figure  18.  Basic  Visual  Properties  of  Displays 


Flicker 
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Perceptible  display  flicker  can  result  in  irritation  and 
fatigue,  as  well  as  perceptual  problems  (Oestberg,  1975).  In  dis¬ 
plays  which  are  refreshed  at  a  constant  rate  regardless  of  content 
(e.g.,  standard  raster  displays),  flicker  is  basically  a  hardware 
design  issue.  In  refreshed  vector  graphical  devices,  display  ele¬ 
ment  type  and  density  affect  display  writing  time,  which  can  affect 
regeneration  rate  and  thus  cause  flicker.  Flicker  may  be  a  hardware 
or  software  issue  here. 

Fairly  good  quantitative  information  exists  relating  flicker 
to  regeneration  rate,  angular  size  of  display  element,  luminance, 
and  phosphor  persistance  and  spectral  composition  (Gould,  1968). 

Dill  and  Gould  (1970)  supplement  this  information  and  investigate 
the  effect  of  beam-scanning  sequence,  which  turned  out  to  be  smaller 
than  expected.  Good  guidelines  exist  with  respect  to  the  regeneration 
rate  required  to  avoid  flicker  with  various  phosphors,  and  good  em¬ 
pirical  methods  of  assessing  the  acceptability  of  a  particular  combina¬ 
tion  of  values  are  available. 

Luminance  and  Luminance  Contrast 

Gould  (1968)  provides  a  good  review  of  the  factors  affecting 
luminance  and  luminance  contrast.  Luminance  contrast  is  also  dis¬ 
cussed  fairly  well  by  Bryden  (1969).  Gould's  recommendations  concern 
ing  these  factors  appear  sound.  Oestberg  (1975)  and  Stocker  (1964) 
point  out  that  office  and  background  illumination  affect  desirable 
display  luminance  values.  Oestberg,  as  well  as  Hultgren  and  Knave 
(1974),  feel  that  "contrast  glare"  due  to  use  of  terminals  in  bright 
offices  is  an  important  factor  in  operator  eyestrain  and  fatigue. 
Hultgren  and  Knave  present  simple  guidelines  for  dealing  with  this 
problem,  and  Stocker  suggests  a  method  for  arriving  at  appropriate 
display  aid  background  illumination  levels  when  the  user  must  work 
with  both  displays  and  hardcopy.  Mezrich  et  al  (1977)  point  out  that 


- - r . 


luminance  interacts  with  chrominance  (hue)  because  of  the  nonlinear 
spectral  sensitivity  of  the  visual  system.  They  present  a  quanti¬ 
tative  model  which  could  probably  be  useful  in  correcting  luminance 
levels  for  changes  in  phosphor  chrominance.  Empirical  evaluation 
is  preferable,  however,  when  a  new  display  device  is  being  developed. 

Chromaticity 

Chromaticity  is  probably  not  an  important  issue  in  monocolor 
displays,  as  long  as  luminance  is  adequate  (Gould,  1968).  Mezrich 
et  al  (1977)  present  a  model  of  the  relationship  between  these  factors. 
Although  the  greatest  sensitivity  of  the  eye  is  in  the  yellow-green 
region,  most  alphanumeric  CRT  displays  produced  in  the  U.S.  are  white. 
In  Europe,  there  is  a  tendency  to  move  toward  yellow  on  the  theory 
that  it  reduces  eyestrain,  but  we  are  unaware  of  empirical  evidence 
specifically  supporting  this  decision. 

Resolution 


Most  studies  of  display  resolution  have  dealt  with  alphanumeric 
or  geometric  symbols  and  have  used  either  standard  visual  acuity  mea¬ 
surement  techniques  or  visual  search  tasks.  The  use  of  measurement 
techniques  more  nearly  similar  to  those  which  will  be  performed  by 
the  user  may  be  more  appropriate.  For  example,  Smith  and  Goodwin  (1973) 
propose  a  check-reading  task  in  which  the  reader  searches  for  random 
letter  substitutions  in  connected  text.  This  technique  is  applicable 
to  the  measurement  of  legibility  as  a  function  of  any  display  parameters, 
but  is  mentioned  here  because  such  techniques  are  most  often  applied 
to  studies  of  resolution. 

Reasonable  guidelines  exist  for  the  most  common  questions  of 
display  resolution  (e.g.,  number  of  scan  lines  or  dots  per  character). 
Even  here,  reasonable  discretion  is  required  in  generalizing  to  tasks 
other  than  those  studied,  and  in  correcting  for  effect  of  variations 
in  other  display  variables.  More  detailed  guidelines  providing  for 
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such  factors  could  be  developed,  based  on  existing  information. 

They  would  necessarily  have  gaps,  and  should  in  any  case  be  supple¬ 
mented  with  empirical  testing  of  the  legibility  of  the  particular 
device. 

Properties  of  Alphanumeric  Characters 

A  very  large  number  of  studies  have  been  done  pertaining  to 
the  properties  of  alphanumeric  displays.  Most  of  the  studies  do  not 
deal  with  electronic  displays,  and  were  outside  the  scope  of  this  sur¬ 
vey  even  though  they  may  be  relevant  to  the  design  of  such  displays. 

It  is  worth  noting,  however,  that  the  visual  properties  of  luminous 
displays  are  somewhat  different  from  those  of  hardcopy  displays  illum¬ 
inated  only  by  reflected  light. 

Important  properties  of  alphanumeric  characters  include  generation 
method  (raster,  dot,  stroke),  font  (character  shape  and  number  of  ele¬ 
ments),  size,  case  (upper  case  only,  or  mixed  upper  and  lower),  spacing, 
and  aspect  ratio  (ratio  of  height  to  width).  These  variables  are  dis¬ 
cussed  in  Figure  19. 

The  best  overview  of  alphanumeric  display  properties  discovered 
in  the  survey  is  Buckler  (1977).  This  paper  contains  a  well  integrated 
set  of  general  (not  task-specific)  guidelines  for  the  variables  men¬ 
tioned  above  and  others. 

DISPLAY  CODING  TECHNIQUES 

A  large  number  of  empirical  studies  have  been  done  relating  to 
alternative  display  coding  techniques  (color  coding,  blink  coding,  etc.). 
Most  of  these  studies  involve  a  comparison  of  two  or  more  different 
coding  techniques  in  the  context  of  a  particular  task.  This  informa¬ 
tion  may  be  helpful  in  selecting  a  coding  technique.  Few  of  the  stu¬ 
dies  seriously  attempt  to  help  the  designer  decide  exactly  how  to 
apply  the  chosen  coding  technique,  however. 
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Figure  19.  Display  Properties  of  Alphanumeric  Characters 


Many  of  the  studies  are  relatively  old,  and  address  coding 
in  other  contexts  than  computer  displays.  Their  generalization 
requires  some  care,  since  existing  computer  displays  involve  images 
which  are  composed  of  (sometimes  perceptible)  dots  or  lines  rather 
than  continuous  areas  of  constant  hue,  brightness,  etc. 

Nonetheless,  there  is  a  wealth  of  relevant  information  on 
this  topic  which  can  reasonably  be  integrated  into  a  design  guide. 

In  the  case  of  color  and  blink  coding,  the  literature  included  in 
the  survey  is  probably  adequate  for  this  purpose.  In  other  cases, 
reference  to  the  older,  non-computer-specific  literature  would  be 
required.  Figure  20  outlines  some  major  conclusions  with  respect 
to  the  four  most  common  coding  techniques. 

Alphanumeric  Coding 

The  survey  turned  up  very  little  information  on  alphanumeric 
coding  which  is  either  recent  or  specific  to  computer  displays.  Older 
studies  referenced  by  Christ  (1975)  and  by  Grether  and  Baker  (1972) 
indicate  that  alphanumeric  coding  is  the  most  accurate  technique  for 
identification  tasks,  and  may  be  tolerable  for  search  tasks.  It 
also  has  the  advantage  of  allowing  a  practically  unlimited  number  of 
coding  categories.  The  older  studies  contain  performance  data  for 
search  time,  etc.  There  are  several  specific  issues  to  be  addressed 
if  alphanumeric  coding  is  to  be  used  satisfactorily,  as  indicated 
in  the  figure. 

Shape  Coding 

Shape  coding,  as  by  the  use  of  geometric  symbols,  is  widely 
used,  particularly  in  graphical  displays.  Because  the  symbols  are 
often  designed  for  a  particular  application,  performance  data  are 
usually  not  generalizable.  Research  does  indicate  that  shape  coding 
is  useable  for  visual  search  and  identification  tasks,  but  that  other 
coding  methods  yield  better  performance.  Grether  and  Baker  (1972) 
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No  research  is  known  on  optimal  on-off  cycle. 


recommend  that  not  more  than  15  symbols  (preferably  not  more  than 
5)  be  used. 

It  should  be  noted,  however,  that  most  studies  of  shape 
coding  have  involved  abstract  tasks,  without  an  underlying,  meaning¬ 
ful  application,  and  symbol  sets  constructed  primarily  for  visual 
discriminability.  That  is,  the  users  had  no  prior  association  of 
the  symbols  with  task-relevant  entities. 

In  those  situations  in  which  the  association  of  symbols  to 
targets  is  not  intended  to  be  meaningful,  the  data  derived  from 
earlier  experiments  are  probably  applicable.  Furthermore,  there 
exist  standard  "alphabets"  of  symbols,  already  optimized  for  dis¬ 
criminability,  which  can  be  recommended  to  the  designer.  Some 
care  is  required  in  applying  these  symbol  sets  to  computer  displays 
as  the  symbol  generation  method  (vector,  dot,  raster)  may  influence 
the  discriminability  of  the  symbols. 

There  are  many  situations,  though,  in  which  the  user's  prior 
association  of  symbols  with  meanings  is  a  primary  advantage  of  shape 
coding.  In  some  cases,  standard  symbol  sets  are  adapted  for  use  on 
the  system.  For  example,  there  are  several  relatively  standard  sets 
of  military  unit  symbols  used  in  tactical  displays  at  various  echelons. 
In  other  cases,  no  pre-existing  symbol  set  is  used,  but  symbols  are 
designed  whose  pictorial  properties  suggest  their  meaning.  In  these 
situations,  the  data  from  classical  research  studies  are  of  limited 
assistance.  Particularly  in  identification  tasks,  it  is  likely  that 
many  more  code  levels  are  possible  here  than  with  meaningless  symbols. 

A  design  guide  should  perhaps  make  suggestions  which  will  assist  the 
designer  in  maximizing  discriminability  of  such  symbols  without  losing 
symbol  recognizability. 

Jacob  et  al  (1976)  have  shown  that  the  use  of  meaningful  sym¬ 
bols  may  be  advantageous  even  in  situations  in  which  symbol  meaning 
does  not  correspond  to  the  properties  of  the  target.  By  using  symbolic 


faces  with  various  facial  expressions,  instead  of  geometric  shapes 
or  several  other  codes ,  they  have  demonstrated  enhanced  performance 
on  both  sorting  and  paired-associates  tasks. 

Color  Coding 

Like  many  aspects  of  interactive  system  design,  the  effects 
of  color  coding  vary  with  a  number  of  other  task  and  display  factors. 

Ir  the  case  of  color  coding,  however,  there  have  been  enough  studies 
that  we  can  begin  to  see  the  pattern  of  these  interactions.  Christ's 
(1975)  excellent  review  of  these  studies  identifies  several  task  and 
display  coding  factors  which  interact  with  color. 

In  general,  relevant  color  coding  —  redundant  or  nonredundant  — 
yields  better  performance  than  other  static  achromatic  coding  techniques 
in  visual  search  tasks.  In  this  context,  "relevant"  implies  not  only 
logical  relevance,  but  also  prior  knowledge,  by  the  user,  of  the  color 
of  the  target  item.  When  the  task  is  to  identify  the  target,  identi¬ 
fication  by  alphanumeric  code  is  more  accurate,  but  color  is  preferable 
to  other  static  achromatic  coding  dimensions.  Performance  advantages 
are  quite  small  in  many  of  these  situations,  however,  and  may  be  negated 
by  interference  with  other  coding  dimensions,  as  well  as  cost  and 
other  factors  discussed  below.  When  targets  are  color  coded  in  a  way 
that  is  irrelevant  to  the  current  task  --  as  might  easily  happen  when 
different  coding  methods  are  used  to  signify  different  target  pro¬ 
perties  --  both  search  and  identification  performance  are  degraded. 
Christ's  review  contains  quantitative  data  on  performance  effects 
which  might  be  useful  to  the  designer  in  making  tradeoff  decisions 
among  coding  dimensions,  especially  when  more  than  one  coding  dimen¬ 
sion  is  to  be  used. 

Teichner  et  al  (1977)  present  a  very  good  analysis  of  color 
coding,  and  suggest  that  it  is  most  effective  when:  "(1)  designating 
a  specific  target  in  a  crowded  display,  (2)  demarcating  an  area  of 
a  display,  (3)  providing  warning  signals  or  commands  which  have  a 
limited  number  of  possible  alternatives,  and  (4)  classifying  or 
grouping  data  where  the  number  of  classifications  are  small." 

116 


Grether  and  Baker  (1972)  provide  guidance  with  respect  to 
the  number  of  hues  (maximum  10,  recommended  3),  and  specific  hues, 
which  might  be  appropriate  for  color  coding.  Haeusing's  (1976) 
data  deal  particularly  with  electronic  raster-scan  displays,  and 
are  probably  more  reliable  in  this  context  than  the  older  studies 
used  by  Grether  and  Baker,  but  Haeusing  deals  only  with  absolute 
color  judgment  tasks.  Situations  in  which  colored  symbols  may 
overlap  (yielding  different  hues)  are  treated  by  Smith  (1963). 

However,  there  are  a  number  of  factors  other  than  simple 
psychophysical  data  which  might  influence  these  choices.  Some 
color  displays  have  a  restricted  color  spectrum.  Beam  penetration 
color  CRT  displays,  for  example,  are  presently  restricted  to  hues 
ranging  from  red  to  a  somewhat  yellowish  green.  For  situations  in 
which  spatial  position  is  important,  these  displays  may  also  have 
rather  critical  calibration  problems.  Chromatic  aberration  can  even 
induce  undesired  three-dimensional  effects  which  influence  the  user's 
perception  of  spatial  position.  Existing  vector  graphic  color  dis¬ 
plays  are  generally  rather  limited  in  display  capacity,  while  raster 
displays  may  have  visual  resolution  problems.  The  decision  to  use 
color,  then,  can  have  a  variety  of  side  effects. 

There  is  evidence  which  suggests  that  users  have  a  rather 
strong  preference  for  color  displays,  even  when  the  use  of  color 
carries  no  performance  benefits.  This  may  lead  to  an  overuse  of 
color  displays  in  situations  in  which  the  use  of  color  coding  to 
aid  in  search  or  identification  is  unnecessary.  There  are  no  known 
data  which  would  help  in  weighing  possible  long-term  motivational 
benefits  of  color  displays  against  cost  and  other  disadvantages 
in  such  areas.  At  the  very  least,  though,  such  possible  motivational 
benefits  are  somewhat  speculative. 


Blink  Codinc 


Smith  and  Goodwin  (1971)  investigated  the  use  of  blink  coding 
in  a  target  detection  task.  They  found  a  50%  improvement  in  search 
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times  when  all  items  of  the  target  class  were  blinked  at  a  3-Hz 
rate.  When  all  nontargets  were  blinked,  performance  improved  almost 
as  well.  In  a  later  experiment  involving  a  check-reading  task  (Smith 
&  Goodwin,  1972),  reading  performance  was  found  to  be  slightly  de¬ 
graded.  The  authors  suggest  that  avoidance  of  such  degradation  prob¬ 
ably  depends  on  the  user's  ability  to  adapt  the  visual  scan  rate 
to  the  blink  rate  of  the  system.  This  depends  on  the  task,  and  might 
suggest  that  the  blink  rate  should  be  determined  in  part  by  an  analysis 
of  the  "natural"  scan  rate  of  the  task. 

Although  there  is  evidence  that  users  can  discriminate  up  to 
four  different  blink  rates  (Cohen  &  Dinnerstein,  1958),  we  believe 
that  blink  coding  should  normally  be  restricted  to  two  categories 
(one  non-blinking)  with  the  possible  addition  of  a  third  for  some 
special  purpose  (e.g.,  a  cursor).  Multiple  blink  rates  on  a  dis¬ 
play  can  be  irritating,  and  perhaps  fatiguing,  if  not  directly  dis¬ 
ruptive  of  performance. 

For  simple  visual  search  tasks  with  only  one  item  or  item 
class  blinking,  a  blink  rate  of  3-4  Hz  is  probably  best  (see  Smith  & 
Goodwin,  1971;  Vartabedian,  1970;  for  indirect  evidence).  No  re¬ 
search  was  found  dealing  with  on-time  versus  off-time  in  the  blink 
cycle,  but  on-time  should  probably  be  50-60%  of  the  cycle. 

Other  Coding  Techniques 

Several  other  coding  techniques  exist  which  are  less  frequently 
used  than  those  discussed  above.  These  include; 

Size 

Depth 

Line  type  (solid,  dashed,  etc.) 

Brightness,  line  width 

Motion 

Focus  or  distortion 


Guidelines  for  the  use  of  these  coding  techniques  can  suggest  the 
maximum  number  of  coding  levels  and  point  out  interactions  (e.g.. 


brightness  and  line  width  should  not  be  used  together  on  CRT  dis¬ 
plays).  In  general,  though, the  only  empirical  data  available  on 
these  techniques  are  simple  psychophysical  data  on  recognition 
and  discrimination.  There  is  little  research  bearing  on  the  appli¬ 
cation  of  these  coding  methods  to  interactive  systems. 

INFORMATIONAL  PROPERTIES  OF  DISPLAYS 
Display  Density 

The  amount  of  information,  or  density  of  information,  which 
a  display  should  have  involves  both  perceptual  and  cognitive  issues. 
It  has  been  shown  repeatedly  that  increasing  the  number  of  displayed 
items  (relevant  or  irrelevant)  on  a  display  increases  time  and  errors 
in  such  tasks  as  visual  search,  counting,  noting  of  display  changes, 
and  extracting  information  from  changed  items  (Baker  et  al,  1960; 
Bowen  et  al,  1963;  Callao  et  al  1977;  Coffey,  1961;  Earl  &  Goff, 

1965;  Green  1953;  Poulton,  1968;  Ringel,  1969;  Ringel  &  Hammer 
1964;  Ringel  &  Vicino,  1964;  Schutz,  1961a).  These  studies  provide 
no  "optimal  number  of  display  elements",  since  that  is  highly  task- 
specific,  but  they  do  provide  data  relating  time  and  error  perfor¬ 
mance  to  the  number  of  displayed  items,  especially  for  visual  search 
tasks.  They  also  suggest  system  design  actions  which  can  help  alle¬ 
viate  problems  of  high  display  density. 

Obviously,  differential  coding  of  relevant  and  irrelevant 
classes  of  display  items  can  help  (see  especially  Ringel,  1969). 

More  to  the  point,  though,  is  the  minimization  of  the  number  of  items 
which  are  coded  with  the  relevant  class  code  (see,  for  example, 
Poulton,  1968).  Reduction  in  number,  or  total  elimination,  of  mem¬ 
bers  of  irrelevant  classes  can  also  help  significantly.  Stewart 
et  al  (1974)  suggest  that  the  user  should  have  the  capability  of 
manually,  and  reversibly,  eliminating  such  items  from  the  display. 


It  is  also  clear  that  people  have  subjective  preferences 
with  respect  to  the  amount  of  information  present  in  a  display, 
with  subjective  ratings  typically  declining  as  the  amount  of 
information  deviates  from  the  preferred  amount,  in  either  direc¬ 
tion  (e.g.,  Vitz,  1966). 

Aside  from  purely  perceptual  problems,  a  high  density  of 
displayed  information  can  also  cause  a  "cognitive  overload",  with 
effects  not  only  on  speed,  but  also  on  the  quality  of  the  user's 
information-integration  and  decision-making  performance.  Dorris 
et  al  (1977)  varied  both  the  number  of  "data  sources"  in  a  display 
and  their  informational  value  (in  the  sense  of  correlating  with  a 
criterion  measure)  and  assessed  the  ability  of  people  to  integrate 
the  information  from  the  various  sources  in  order  to  predict  the 
criterion  value.  Performance  improved  as  the  number  of  data  sources 
increased  from  two  to  five,  and  then  dropped  off  when  eight  data 
sources  were  displayed.  Perhaps  more  important,  performance  was 
uniformly  poor  when  the  data  sources  were  uncorrelated  with  one 
another.  Like  many  other  studies,  this  experiment  illustrated 
the  rather  sharp  limits  of  human  ability  to  integrate  information 
from  a  variety  of  unrelated  sources.  Unfortunately,  guidelines 
on  this  topic  are  not  easy  to  generate,  and  we  do  not  immediately 
see  how  to  do  so.  In  complex  cognitive  tasks,  a  proper  task  analysis 
and  consideration  of  appropriate  problem-solving  aids  --  both  of 
which  were  discussed  earlier  --  may  help.  Whether  or  not  guidelines 
at  the  level  of  display  formatting  will  help  is  an  unresolved  issue. 

Display  Formatting  in  General 

Standard  human  factors  textbooks  and  guidelines  contain 
a  great  deal  of  material  dealing  with  displays.  Only  a  portion  of 
this  material  is  applicable  to  typical  computer  displays,  however. 

For  example,  Grether  and  Baker's  (1972)  very  good  overview  of  human 
factors  issues  in  displays  deals  largely  with  analog  displays  such 
as  those  in  aircraft,  but  has  little  information  on  either  textual 
or  graphical  displays.  Although  there  is  a  good  deal  of  relevant 
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research,  no  extensive  guidelines  appear  to  have  been  developed 
for  displays  of  this  sort.  Engel  and  Granda's  (1975)  presentation 
is  excellent,  but  has  many  gaps,  and  Danchak's  (1976)  discussion 
is  nicely  popularized,  but  sketchy  and,  in  places,  unsound. 

Stewart  (1976)  provides  a  brief  overview  of  several  basic 
principles  which  the  display  designer  should  keep  in  mind.  These 
principles  include  logical  sequencing,  spaciousness,  relevance, 
consistency,  grouping,  and  simplicity.  An  example  is  provided 
which  illustrates  the  nature  and  benefits  of  these  principles 
rather  well.  It  is  not  entirely  clear  that  this  is  sufficient 
information  to  allow  the  designer  to  successfully  apply  the  princi¬ 
ples,  however.  Certainly  some  such  discussion,  which  provides  a 
conceptual  framework  for  more  specific  guidelines  (e.g.,  Engel  & 

Granda)  is  appropriate  in  a  design  guide.  It  may  be  that  textbooks 
on  graphic  arts  and  presentation  media  are  the  best  existing  source 
for  such  a  high-level  introduction,  since  psychological  research  has 
not  yet  succeeded  in  producing  a  unifying  conceptual  framework  in 
this  area,  as  will  be  seen. 

Several  attempts  have  been  made  to  determine  what  aspects 
of  displays  are  of  primary  psychological  importance.  A  successful 
effort  here  could  provide  a  very  useful  conceptual  framework  around 
which  to  orient  both  display  formatting  guidelines  and  display  re¬ 
search.  Thus  far,  the  efforts  have  not  been  totally  satisfactory, 
though.  The  best  of  these  efforts  is  probably  the  factor-analytic 
study  of  Siegel  and  Fischl  (1971)  and  the  resulting  "Analytic  Pro¬ 
file  System"  (Siegel  et  al,  1969).  This  effort  identified  seven 
subjectively  relevant  dimensions  of  the  displays  used  in  the  study. 

The  dimensions  were:  stimulus  numerosity,  primary  coding,  contextual 
discrimination,  structure  scanning,  critical  relationships,  cue  inte¬ 
gration,  and  cognitive  processing.  There  are  a  number  of  methodological 
criticisms  of  this  study  which  make  these  factors  somewhat  suspect,  and 
they  are,  as  stated,  based  on  subjective  measures  rather  than  perfor¬ 
mance.  They  have  good  face  validity,  though,  and  are  well  explained. 
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Landis  et  al  (1967)  attempted  to  use  a  multiple  linear  re¬ 
gression  to  relate  display  and  operator  properties  to  operator 
"decision  quality".  Their  attempt  has  the  advantage  that  it  is 
based  on  performance,  rather  than  subjective  ratings.  They 
found  a  number  of  specific  relationships  between  display  properties 
and  performance  (e.g.,  small  displays  were  better  than  large,  non- 
redundant  color  coding  helped,  etc.).  Their  findings  are  probably 
quite  task-specific,  however. 

Given  the  present  stage  of  development  of  such  studies,  a  generic 
framework  does  not  exist  which  can  encompass  a  significant  variety  of 
display  or  display  element  types  in  a  way  useful  to  the  designer.  Thus, 
the  guidelines  must  either  provide  only  a  general  overview  or  must 
provide  a  large  number  of  fairly  situation-specific  suggestions.  In¬ 
deed,  it  may  be  impossible  to  avoid  the  latter  even  with  a  good  frame¬ 
work,  since  many  guidelines  (e.g.,  display  of  dates  and  times)  may  be 
matters  of  population  stereotype  rather  than  good  generic  principles. 

Two  other  aspects  of  display  formatting  are  largely  indepen¬ 
dent  of  detailed  display  type,  and  should  be  covered  prior  to  a 
discussion  of  individual  display  types.  The  first  is  display  part¬ 
itioning,  which  involves  the  sharing  of  a  single  display  surface 
by  several  display  areas,  usually  segregated  by  lines.  Although 
this  is  a  common  practice  which  is  clearly  advantageous  in  many 
cases,  no  research  is  known  pertaining  to  methods  of  partitioning, 
number  of  areas,  etc.  Miller  and  Thomas  (1976)  make  some  reasonable 
suggestions  for  use  of  various  partitions.  Martin  (1973)  shows 
numerous  examples  of  partitioned  displays. 

In  the  event  that  the  user  must  have  access  to  more  informa¬ 
tion  than  can  reasonably  be  presented  at  one  time,  the  display  may 
provide  access  to  a  selectable  portion  of  the  information  (as  by 
windowing  or  scrolling),  or  the  display  may  be  time-shared  so  that 
only  a  portion  of  the  information  is  displayed  at  once,  with  the  user 
able  to  switch  the  display  back  and  forth  in  order  to  see  it  all. 


Although  these  techniques  are  in  common  use,  little  research  has 
been  done  on  them. 

In  windowing,  the  display  is  used  as  a  "window"  through 
which  the  user  can  view  a  moveable  portion  of  the  display  space. 
Windowing  can  be  used  with  graphical  or  tabular  displays,  as  well 
as  alphanumeric  displays  which  convey  spatial  information  (e.g.,  some 
air-traffic-control  displays).  Typically,  the  user  has  left,  right, 
up,  and  down  controls  with  which  to  move  the  window  or  the  display. 
Granda  (1978)  has  demonstrated  the  feasibility  of  windowing  in  a 
visual  search  task,  but  was  unable  to  resolve  the  "inside/outside" 
problem  of  whether  the  direction  of  the  user's  controls  should  cor¬ 
respond  to  the  motion  of  the  window  or  of  the  underlying  display. 

Our  very  informal  study  of  this  problem  suggests  that  pilots  and  non¬ 
pilots  have  opposite  stereotypical  responses  in  this  situation,  but 
we  have  not  performed  a  controlled  experiment. 

Scrolling  is  similar  to  windowing,  but  involves  only  text, 
and  only  vertical  motion  of  the  window.  No  research  was  found 
dealing  with  scrolling.  Giddings  (1972)  presents  an  interesting 
discussion  of  the  time-sharing  of  displays,  and  makes  suggestions 
with  respect  to  determining  what  information  can  be  time-shared, 
preventing  the  loss  of  positionally  coded  information,  and  provid¬ 
ing  appropriate  means  for  operator  control  of  time-shared  displays. 

It  is  fairly  typical  for  sophisticated  graphical  applications 
to  involve  partitioning,  windowing,  and  time-sharing,  and  these 
features  can  interact.  A  discussion  of  these  topics  in  a  design 
guide  should,  therefore,  be  integrated.  At  present,  though,  the 
content  of  such  a  discussion  will  depend  more  on  art  than  on  science. 

Graphical  versus  Nongraphical  Displays 

In  many  instances,  the  designer  has  a  choice  between  pre¬ 
senting  information  in  graphical  form  and  presenting  it  in  tabular 
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or  other  alphanumeric  form.  Although  the  merits  of  these  different 
approaches  are  by  no  means  settled,  there  are  a  few  studies  which  bear 
on  this  decision.  Important  factors  are  the  relative  importance  of 
speed  vs  accuracy,  whether  or  not  the  user  is  required  to  recall  in¬ 
formation  from  the  display  after  it  has  been  removed,  and,  of  course, 
the  nature  of  the  information  to  be  conveyed. 

There  is  evidence  that  information  which  is  spatially  encoded 
on  the  display  can  be  recalled  with  greater  speed  and  accuracy  than 
information  which  is  alphabetically  encoded  (Howell  &  Tate,  1966; 

Newman  &  Davis,  1963),  although  a  study  by  Nawrocki  (1972)  failed  to 
substantiate  this  finding.  There  are  many  task  variables  not  explicitly 
considered  in  these  studies,  and  not  enough  research  has  been  done  to 
allow  a  firm  conclusion.  There  is,  of  course,  a  large  literature  deal¬ 
ing  with  verbal  versus  nonverbal  encoding  of  information  in  human 
memory ,  and  that  literature  was  not  included  in  this  survey,  nor  is  that 
literature  wel 1 -integrated  (or  even  consistent). 

Wright  and  Reid  (1973)  compared  flowcharts,  decision  tables, 
prose,  and  "short  sentence"  descriptions  of  a  simple  algorithm  in  an 
experiment  in  which  participants  hand-executed  the  algorithm  and 
memorized  it.  The  flowchart  --  which  might  be  correctly  viewed  as 
a  graphical  display  --  was  superior  for  hand  execution,  but  was  not 
conducive  to  memorization.  The  "short  sentence"  form,  a  structured 
verbal  presentation  similar  to  a  program  design  language,  was  best 
for  memorization.  It  is  not  entirely  clear,  though,  that  a  flowchart 
should  be  regarded  as  a  graphical  in  the  same  sense  that  cartographical, 
pictorial,  and  Cartesian-graph  displays  are  graphical. 

A  little  more  information  is  available  for  those  situations 
in  which  the  user  extracts  information  from  the  display  while  it  is 
still  present.  When  the  information  to  be  displayed  is  a  mathemati¬ 
cal  function,  it  would  appear  that  graphs  allow  more  rapid  extrac¬ 
tion  of  information  about  individual  values  of  the  function,  but  tables 
allow  equal  accuracy  (Cropper,  1968)  or  greater  accuracy  (Carter, 
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1947,  1948;  Hoisman  et  al ,  1963).  The  accuracy  question  is  greatly 
affected  by  the  need  for  interpolation.  This  varies  with  the  task, 
size  of  the  table,  and  --  for  functions  involving  three  or  more 
variables  --  the  number  of  lines  on  the  graph.  If  also  seems  clear 
that  graphical  presentations  are  preferable  if  the  user's  task  is 
to  understand  the  basic  form  of  the  functional  relationship  without 
regard  to  quantitative  accuracy. 

Booher  (1975)  compared  the  effects  of  pictorial  and  verbal 
information  when  the  user's  task  was  to  execute  a  procedure  involving 
a  series  of  discrete  actions  in  the  use  of  electronic  equipment. 

Speed  of  execution  of  the  procedure  was  greater  when  the  procedure 
was  described  pictorial ly,  but  a  verbal  description  of  the  procedure 
resulted  in  greater  accuracy.  At  a  superficial  level,  this  result 
appears  similar  to  those  discussed  above.  It  is  likely,  though, 
that  a  detailed  task  analysis  would  show  that  the  pictorial  informa¬ 
tion  aided  in  some  subtasks  (e.g.,  locating  a  switch  or  dial),  while 
the  verbal  information  was  more  helpful  in  others  (e.g.,  achieving 
the  correct  dial  setting). 

Vicino  and  Ringel  (1966)  studied  the  effects  of  graphical 
and  alphanumeric  presentations  of  tactical  information  used  by 
decision  makers  to  recognize  that  enemy  forces  were  massing  for 
attack.  There  was  no  difference  in  decision  quality  between  the 
two  presentation  forms.  It  should  be  noted,  however,  that  the  study 
used  personnel  unfamiliar  with  this  task  and  with  the  symbols  in¬ 
volved.  This  result  might  very  well  be  different  with  experienced 
users. 


Overall,  the  research  literature  dealing  with  the  graphical- 
nongraphical  display  question  is  disappointing.  Not  only  is  more 
research  needed  before  extensive  guidance  can  be  confidently  pro¬ 
vided,  but  that  research  must  be  much  better  integrated  than  it 
has  been  in  the  past.  Otherwise,  variations  in  task  and  type  of 
information  to  be  displayed  will  render  the  guidelines  inapplicable 
to  most  design  decisions. 
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Formatting  of  Tabular  Displays 

The  information  available  with  respect  to  the  formatting 
of  tabular  displays  does  not  seem  to  have  changed  much  since 
Stewart  et  al's  (1974)  survey.  Most  of  the  studies  in  this  area 
did  not  deal  explicitly  with  computer  displays,  and  were,  there¬ 
fore,  excluded  from  this  survey.  We  are  aware  of  them  from  second¬ 
ary  sources,  however.  The  available  empirical  information  is 
sketchy,  and  it  may  be  that  the  best  source  of  guideline  material 
is  still  to  be  found  in  the  existing  guidelines  for  graphic  design 
used  by  artists,  publishers,  etc. 

Several  investigators  have  studied  the  differential  effects 
of  horizontal  and  vertical  arrangements  of  lists.  When  no  recall 
was  required,  Coffee  (1961)  and  Earl  and  Goff  (1965)  found  no 
differential  effects.  Mayzner  (1966)  and  Mayzner  ana  Gabriel  (1964) 
found  that  spatial  organization  of  alphanumeric  data  is  important 
when  the  material  is  to  be  recalled.  Cropper  (1968)  demonstrated, 
among  other  things,  that  breaking  tabular  displays  into  blocks 
(e.g.,  3x3  blocks)  improved  visual  search  time.  Wright  (1968) 
investigated  several  different  tabular  formats  (e.g.,  matrix, 
"schematic"  table,  table  of  lists)  and  found  significant  performance 
differences. 

Haney  (1969)  compared  a  wel 1 -formatted  tabular  display  -- 
containing  test  instructions  for  electronic  equipment  --  with  the 
previously  used  verbal  description  of  the  procedure.  The  tabular 
display  resulted  in  fewer  errors.  This  report  provides  several 
ideas  for  the  tabular  presentation  of  procedural  information. 

Stewart's  (1976)  formatting  principles,  discussed  above, 
are  presented  primarily  in  the  context  of  tabular  displays,  and 
are  highly  applicable  here.  Engel  and  Granda  (1975)  also  present 
a  number  of  low-level  guidelines  for  tabular  displays,  and  especially 
list  displays.  However,  no  well  integrated  set  of  guidelines  was 
found. 


Formatting  of  Graphical  Displays 

A  very  similar  situation  exists  in  the  area  of  formatting 
graphs  and  charts.  Almost  all  of  the  research  literature  on  this 
topic  is  concerned  with  non-computer  displays,  and  was,  therefore, 
not  within  the  basic  scope  of  the  survey.  This  research  literature 
contains  many  studies  comparing  two  or  more  kinds  of  graphical 
displays  for  the  presentation  of  a  particular  class  of  information. 
Many  of  the  studies  were  done  more  than  20  years  ago,  and  they  do 
not  form  a  particularly  well  integrated  body  of  knowledge,  but  they 
are  relevant  to  the  development  of  design  guidelines  in  this  area. 
Studies  of  which  we  are  aware  include  Bowen  and  Gradijan  (1963), 
Cropper  (1968),  Croxton  &  Stein  (1932),  Price  et  al  (1974),  Schmid 
(1954),  Schutz  (1961a,  1961b),  Schwartz  and  Taylor  (1968),  Strick¬ 
land  (1938),  Vernon  (1946,  1950,  1952a,  1952b,  1953),  Wainer  (1974), 
and  Washburne  (1927). 

It  is  clear  that  many  specific  guidelines  can  be  derived 
from  these  studies.  For  example,  multiple  lines  on  a  single  graph 
are  better  than  multiple  graphs  if  the  task  requires  comparison  of 
the  trend  lines,  but  not  otherwise  (Schutz,  1961b).  It  is  also 
clear,  though,  that  this  approach  will  leave  many  gaps  in  the  result¬ 
ing  guidelines.  It  would  appear  that  the  best  approach  is  to  begin 
with  existing  graphic-arts  design  material,  which  is  clearly  prag¬ 
matic  in  origin,  and  rework  it  in  accordance  with  available  empiri¬ 
cal  data.  The  dangers  of  "folklore"  are  well  established,  and  care 
will  be  required  in  exercising  this  approach.  Some  existing  guide¬ 
lines  in  the  graphic  arts  area  which  might  provide  a  good  starting 
point  are  Lutz  (1949),  Schmid  (1954),  and  possibly  Huff  (1954), 

Bertin  (1967),  and  Dickinson  (1973). 

Modern  applications  of  interactive  graphics  tend  to  involve 
much  more  sophisticated  displays  than  simple  charts  and  graphs, 
however.  A  few  empirical  studies  have  been  done  on  the  graphical 
display  of  particular  kinds  of  information  (e.g.,  geographic. 
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hierarchic,  temporal  --  sec  next  section  on  special  display  types), 
but  no  well-integrated  picture  emerges  from  the  surveyed  literature. 
This  is  an  area  in  which  applications  are  being  developed  with 
relatively  little  human  factors  research  input,  although  human  fac¬ 
tors  personnel  seem  to  be  able  to  assist  significantly  when  they 
are  involved  in  such  projects,  on  the  basis  of  their  general  know¬ 
ledge  of  human  factors  principles  and  of  human  vision  and  informa¬ 
tion  processing.  Unfortunately,  it  is  not  yet  clear  how  to  capture 
this  knowledge  in  a  way  that  is  really  useable  to  the  designer,  other 
than  by  providing  guidelines  for  specific  display  types,  coding, 
etc. 

Special  Display  Types 

Some  useful  information  is  available  concerning  effective 
designs  for  a  few  special  display  types  (e.g.,  geographic  displays). 
Much  of  this  information  is  to  be  found  in  reports  of  the  design  of 
particular  systems,  which  were  not  included  in  the  survey.  A  number 
of  research  projects  have  also  investigated  specialized  display  types, 
and  those  reports  were  generally  within  the  scope  of  the  survey. 

It  seems  reasonable  to  suppose  that  guidelines  dealing  with  particu¬ 
lar  display  types  would  be  effective  in  a  design  guide,  especially 
if  the  design  guide  is  aimed  at  a  particular  class  of  system.  Effec¬ 
tive  development  of  such  guidelines  would  require  additional  atten¬ 
tion  to  the  literature  on  specific  systems,  however.  It  would  also 
require  considerable  reliance  on  general  human  factors  knowledge, 
since  available  empirical  data  cover  only  a  small  proportion  of  the 
design  issues.  Much  of  the  guidelines  development  effort  would 
involve  evaluating  ideas  used  in  existing  systems  or  advocated  in 
the  literature. 

Geographical  displays,  especially  for  tactical  systems,  have 
received  considerable  attention  (see  especially  Bowen  et  al,  1975; 
Connelly,  1977;  Hammell  et  al ,  1975;  Irving  et  al ,  1977;  Smith  et 
al ,  1972;  Vicino  &  Ringel,  1966).  Displays  for  hierarchic  search 
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dialogues  are  discussed  by  Thompson  (1969,  1971)  and  Uber  et  al 
(1968).  TOTE  displays  were  investigated  by  McKendry  et  al  (1969). 
Displays  of  mathematical  trends  were  investigated  by  Schutz 
(1961a,  b),  but  see  the  previous  discussion  of  graphical  display 
formatting  for  other  papers  dealing  with  graphs  and  charts.  Tem¬ 
poral  displays  which  present  information  on  a  series  of  discrete 
events  are  discussed  by  Chesler  and  Turn  (1967),  and  are  discussed 
in  the  specific  context  of  process  control  by  Goodstein  (1969) 
and  Griffin  (1973).  The  use  of  time  compression  in  displays  for 
monitoring  tasks  is  shown  to  be  effective  by  Howell  et  al  (1966) 
and  Scanlan  (1975),  who  present  useful  ideas  for  employment  of 
this  technique.  Three-dimensional  displays  are  discussed  by  Gutt- 
mann  and  Anderson  (1962)  and  by  Vlahos  (1965),  who  provides  a 
reasonably  good  discussion  of  the  visual  cues  and  "anti-cues" 
involved  in  such  displays. 
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INPUT  DEVICES  AND  TECHNIQUES 


INPUT  DEVICE  TYPES 

There  exist  a  large  number  of  input  device  types  which  might 
be  considered  by  the  designer  of  an  interactive  system  (see  Figure  21). 
Although  there  is  a  very  large  body  of  research  reports  in  this  area, 
the  vast  majority  of  the  research  has  been  concerned  with  keyboards, 
and  most  of  the  other  input  devices  have  received  relatively  scant 
attention.  The  literature  on  keyboards  is  quite  rich,  and  provides  a 
reasonable  basis  for  guidelines  on  almost  all  physical  aspects  of  key¬ 
boards,  and  many  logical  and  procedural  aspects  as  well.  The  research 
dealing  with  other  input  devices,  at  least  in  the  context  of  computer 
systems,  consists  mostly  of  studies  comparing  the  speed  and  accuracy 
of  two  or  more  specific  input  devices  in  the  context  of  a  particular 
task.  While  such  studies  may  help  in  device  selection  decisions, 
they  provide  little  data  to  support  the  design  of  the  input  device 
i tself . 


It  is  useful  to  consider  alternative  input  devices  in  terms 
of  five  main  types  of  input  tasks: 

(1)  Text  input 

(2)  Input  of  numerical  quantities 

(3)  Selection  of  command,  operand,  etc.  from  display 

(4)  Discrete  positional  ("graphical")  input 

(5)  Continuous  positional  ("graphical")  input 

This  is  not  necessarily  a  complete  list;  for  example,  it  is  not 
entirely  clear  where  the  selection  of  binary  values  or  the  input 
of  alphanumeric  codes  belongs.  It  appears, though,  to  be  an  appro¬ 
priate  list  for  briefly  discussing  the  properties  of  the  major 
interactive  input  devices. 

The  virtual  input  devices  (pick,  button,  locator,  valuator) 
advocated  by  Foley  and  Wallace  (1974)  and  Wallace  (1976)  provide  a 
somewhat  similar  view  of  input  task  types,  and  suggest  that  the 
variety  of  input  tasks  could  usefully  be  reflected  in  the  modular 
structure  of  interactive  dialogue  control  software. 
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Figure  21.  Input  Device  Types  (Page  1  of  3) 
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Figure  21.  Input  Device  Types  (Concluded) 


In  most  cases,  the  basic  input  task  dictates  that  one  parti¬ 
cular  class  of  input  device  will  be  used.  For  example,  a  require¬ 
ment  for  the  input  of  text  (case  1),  in  significant  quantities, 
usually  implies  an  alphabetic  keyboard.  Input  rates  available  by 
selecting  letters  from  a  displayed  list,  using  a  lightpen,  trackball, 
or  whatever,  would  likely  be  unacceptable  unless  the  typing  component 
of  the  user's  task  is  very  small.  In  fairness,  though,  there  is  little 
empirical  data  on  this  (one  study,  by  Morrill  et  al,  1968,  was  not 
methodologically  adequate  to  resolve  this  issue).  For  occasional 
typing,  the  use  of  a  touchpanel  with  a  displayed  keyboard  might  work, 
but  there  are  serious  problems  of  proprioceptive  feedback  here,  and 
no  known  research  to  support  such  a  design  decision.  Under  some 
circumstances,  constrained  hand  printing  for  input  by  optical  character 
recognition  may  be  acceptable,  but  input  rates  using  this  method  are 
often  lower  than  those  achievable  by  even  unskilled  typing  (Devoe,  1967; 
Masterson  &  Hirsch,  1962). 

Similarly,  the  requirement  for  positional  input  --  whether  dis¬ 
crete  (case  4)  or  continuous  (case  5)  --  usually  implies  the  use  of 
an  explicitly  positional  (or  "graphical")  input  device.  There  are 
several  such  devices,  including  the  lightpen,  joystick,  trackball, 
mouse,  graphical  input  tablet,  and  touch  panel.  Selection  among  these 
devices  depends  on  a  more  detailed  analysis  of  the  user's  tasks. 

When  the  user's  task  is  the  selection  of  individual  words  or 
characters  from  a  displayed  list  or  text  (case  3),  either  keyboard 
or  graphical  input  devices  may  be  appropriate,  and  most  of  the  com¬ 
parative  experiments  have  been  concerned  with  this  situation.  Earl 
and  Goff  (1965)  compared  "type-in"  and  (simulated)  "point-in"  input 
methods  for  selecting  a  word  from  a  displayed  list,  as  in  command 
or  operand  selection.  Although  experienced  typists  performed  better 
with  type-in  input,  everyone  else  exhibited  greater  accuracy,  and 
perhaps  greater  speed,  when  point-in  input  was  used.  This  perfor¬ 
mance  advantage  probably  exists  only  when  the  alternative  list  is 
relatively  short  and  contains  readily  discriminable  items.  However, 
a  properly  designed  command  language  should  have  those  properties 
anyway  (see  previous  section  on  command  languages).  Ramsey  (unpublished) 
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has  also  found  a  well-designed  lightpen  input  to  be  both  faster 
and  more  accurate  than  function  keys  for  very  simple  (verb-noun- 
adjective)  command  construction. 

For  selecting  particular  words  or  characters  from  a  text 
display,  as  in  text  editing,  a  lightpen  appears  to  be  much  faster 
than  keyboard  cursor  controls  (Goodwin,  1975),  and  a  mouse  appears 
to  be  both  faster  and  more  accurate  than  both  ordinary  cursor  con¬ 
trol  keys  and  "text"  keys  which  advance  to  the  next  character, 
word,  line,  or  paragraph  (Card  et  al ,  1977).  Among  the  graphical 
input  devices  investigated  for  this  situation,  the  mouse  and,  next, 
the  lightpen,  appear  preferable  to  a  rate-controlled  joystick, 
graphical  input  tablet,  and  knee  control  (Card  et  al ,  1977;  English 
et  al ,  1967).  The  trackball  may  be  a  very  viable  candidate  here, 
but  we  are  unaware  of  any  comparative  study  involving  its  use  for 
text  selection.  A  touch  panel  might  also  be  useable  for  selection 
of  whole  words,  although  individual  character  selection  is  probably 
beyond  its  resolution  limits.  No  useable  empirical  data  were  found 
pertaining  to  the  use  of  touch  panels  for  any  task. 

The  survey  found  few  empirical  studies  of  continuous  posi¬ 
tional  input,  such  as  occurs  in  freehand  drawing.  Herot  (1976)  and 
Negroponte  (1975)  provide  a  very  interesting  discussion  of  software 
used  for  this  task  in  conjunction  with  a  graphical  input  tablet, 
but  provide  no  empirical  data.  Irving  et  al  (1976)  found  a  trackball 
superior  to  both  lightpen  and  joystick  for  drawing  straight  lines 
and  circles,  but  did  not  investigate  more  sophisticated  hand  drawing. 

It  is  important  to  consider  all  the  input  tasks  of  the  user 
before  selecting  any  input  device  or  devices.  In  part,  this  is  a 
simple  matter  of  avoiding  suboptimization  and  finding  a  cost-effective 
solution.  If,  for  example,  the  user  has  considerable  typing  and  a 
moderate  amount  of  text  editing  to  do,  keyboard  cursor  controls  or 
thumbwheel  cursor  controls  mounted  on  the  keyboard  might  be  accept¬ 
able.  Obviously,  other  factors  enter  into  such  a  decision,  such  as 
user  hourly  cost,  user  discretion,  system  operating  cost,  basic 
terminal  cost,  etc.  Consider,  though,  the  similar  case  in  which  the 
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user  must  do  considerable  typing,  a  moderate  amount  of  text  editing, 
and  some  freehand  drawing.  If,  in  this  case,  a  trackball  or  lightpen 
is  used  for  the  drawing,  it  should  probably  also  be  used  for  cursor 
control  or  text  selection  in  the  text  editing  task.  It  would  be  nice 
to  be  able  to  assert  that  a  great  deal  of  human  performance  data  exist 
to  support  such  tradeoff  decisions,  but  in  fact,  there  appear  to  be 
very  limited  data  for  input  devices  other  than  keyboards. 

Another  reason  for  considering  all  the  user's  input  tasks  is 
the  performance  effect  of  mode  mixing.  When  the  user  must  alternate 
modes  of  input,  as  by  alternately  typing  and  using  a  lightpen,  the 
performance  effects  can  be  highly  detrimental.  For  example,  Earl  and 
Goff  (1965)  found  both  point-in  and  unskilled  type-in  input  to  be 
more  accurate  and  faster  than  any  of  several  conditions  in  which  some 
point-in  and  some  type-in  input  were  combined.  It  is  important  that 
mode  mixing  problems  be  considered  in  selecting  input  devices.  It 
is  common,  for  example,  for  a  trackball,  mouse,  or  joystick  to  be 
selected  in  preference  to  a  lightpen  because  the  user  must  also  often 
use  the  keyboard  for  typing.  Card  et  al  (1977)  provide  a  useful  analy¬ 
sis  of  the  temporal  properties  associated  with  a  simp^  case  of  mode 
mixing,  and  they  provide  a  simple  quantitative  formula  (based  on  Fitts' 
Law)  which  accounts  for  observed  user  performance  differences  among 
several  text  selection  devices  on  the  basis  of  the  known  dynamics  of 
human  hand  movement. 

If  mode  mixing  affects  user  performance,  then  it  is  also  impor¬ 
tant  to  know  the  assumptions  underlying  comparative  performance  studies. 
For  example,  both  Card  et  al  (1977)  and  English  et  al  (1967)  began  each 
trial  with  depression  of  the  space  bar  by  the  user,  using  the  same 
hand  that  would  then  be  used  to  control  the  text-selection  device. 

This  experimental  paradigm  clearly  simulates  a  case  in  which  typing 
is  alternated  with  text  selection.  It  is  easy  to  imagine  a  dialogue 
in  which  such  mode  mixing  is  unnecessary,  as  by  using  a  lightpen  for 
both  command  and  operand  selection.  It  is  quite  likely  that  performance 
with  the  lightpen  would  at  least  equal  that  using  a  mouse,  when  this 
paradigm  is  used.  Other  factors  are  relevant  here,  though,  especially 
the  fatigue  associated  with  long-term  use  of  the  lightpen. 
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The  minimization  of  such  problems  requires  the  designer  to 
take  a  systems  view  of  the  user's  input  requirements,  and  to  deter¬ 
mine  the  overall  set  of  input  devices  based  on  that  view.  This  can 
lead  to  a  very  different  solution  than  might  result  from  isolated 
consideration  of  each  input  task  in  turn.  For  an  advanced  word 
processing  application,  Engelbart  (1973)  has  even  gone  so  far  as 
to  advocate  the  operation  of  a  mouse  with  one  hand,  for  text  selec¬ 
tion,  while  the  other  hand  operates  a  5-key  chorded  alphabetic  keyboard 
for  both  command  selection  and  text  input.  (A  standard  alphanumeric 
keyboard  is  also  provided.)  This  obviously  requires  a  highly  trained 
user,  and  may  not  often  be  cost-effective,  but  it  illustrates  a  solu¬ 
tion  developed  explicitly  to  minimize  mode  mixing. 

Based  on  such  considerations  as  have  been  discussed  in  this 
section,  it  is  clear  that  procedural  guidelines  can  be  developed  for 
use  by  the  designer  in  selecting  a  set  of  input  devices  for  a  parti¬ 
cular  system.  It  is  also  clear,  though,  that  such  guidelines  may  be 
somewhat  speculative,  since  inadequate  empirical  performance  data 
exist  with  respect  to  many  of  the  input  devices  and  input  tasks.  Still, 
a  human  factors  analysis  of  those  devices  and  tasks  would  provide  a 
firmer  basis  for  design  decisions  than  is  now  available  to  most  inter¬ 
active  system  designers. 

Once  the  designer  has  chosen  a  particular  type  or  types  of  input 
device,  it  is  desirable  to  support  further  design  decisions  dealing 
with  the  selection  or  design  of  the  specific  keyboard,  lightpen,  etc., 
which  will  be  used  in  the  system.  Except  for  keyboards,  few  empirical 
data  were  found  which  might  support  these  detailed  design  decisions. 

This  does  not  necessarily  imply  that  no  such  data  exist.  In  many  cases, 
studies  of  these  devices  have  been  conducted  outside  the  context  of 
interactive  computer  systems,  or  unpublished  studies  have  been  done 
by  manufacturers.  Nonetheless,  the  overall  body  of  knowledge  in  the 
area  of  detailed  design  parameters  of  nonkeyboard  interactive  input 
devices  is  disappointing. 


137 


In  the  case  of  keyboards,  a  very  extensive  literature  exists, 
and  only  survey  studies  and  a  few  specialized  papers  were  included 
in  our  survey.  A  good  descriptive  survey  of  the  literature  is  Alden 
et  al  (1972),  although  this  report  contains  little  in  the  way  of 
derived  conclusions  or  guidelines.  Klemmer  (1971)  also  provides  a 
good,  though  brief,  review  of  this  literature.  The  review  in  Stewart 
et  al  (1974)  is  more  critically  penetrating  and  is  directed  primarily 
at  interactive  computer  systems,  but  also  provides  little  in  the  way 
of  guidelines.  The  best  prescriptive  review  of  data  entry  devices 
and  techniques  in  general,  and  keyboards  in  particular,  is  Seibel 
(1972).  The  guidelines  presented  by  Seibel  are  crisp,  sound,  and 
are  usually  well  founded  in  empirical  research. 

Such  guidelines  should  be  augmented  by  a  somewhat  more  pro¬ 
cedural  discussion  of  keyboard  design  issues  frequently  encountered 
in  the  design  of  interactive  systems.  Such  issues  include  deter¬ 
mination,  placement  and  grouping  of  special  function  keys,  color 
coding  of  keys,  and  a  variety  of  computer-specific  functions  (e.g., 
"break")  which  do  not  appear  on  typewriter  keyboards  (see  also 
Dolotta,  1970). 

TERMINALS,  WORK  STATIONS,  AND  FACILITY  LAYOUT 

Although  displays  and  input  devices  have  been  discussed 
separately,  there  are  some  remaining  design  decisions  which  concern 
the  packaging  of  such  devices  into  terminals,  consoles,  and  whole 
facilities.  The  issues  involved  are  primarily  those  of  physical  lay¬ 
out,  and  are  precisely  the  areas  in  which  existing  non-computer- related 
human  engineering  methods  and  guidelines  are  most  directly  applicable. 
Design  guidelines  in  these  areas  can  be  highly  quantitative,  and 
should  probably  be  much  like  Van  Cott  and  Kink.ade’s  (1972)  guide¬ 
lines  for  single-user  workstation  design  or  Thomson's  (1972)  guide¬ 
lines  for  multi-user  workstations.  Standard  human  engineering  texts 
are  also  applicable,  as  are  portions  of  existing  military  standards 
(e.g.,  MI L- STD- 1 472B ) .  Dolotta  (1970)  suggests  a  reasonably  good 
set  of  standards  for  a  teletypewriter  terminal,  but  there  are  actually 
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very  few  guidelines  dealing  explicitly  with  computer-related 
workstation  design. 

Most  of  the  literature  which  deals  with  computer- related 
console  and  facility  layout  consists  of  case  studies.  This  informa¬ 
tion  can  provide  a  useful  format  for  guidelines  which  will  be  used 
by  designers  unsophisticated  in  human  factors.  Very  few  new  princi¬ 
ples  are  to  be  found  in  these  case  studies,  however,  even  though 
most  are  quite  good.  It  would  appear  that  existing  guidelines  on 
facility  and  console  layout  are  about  as  applicable  to  computer 
systems  as  in  other  types  of  person-machine  systems.  Among  the 
case  studies  are  LeCocq's  (1977)  description  of  a  CRT  terminal  design 
project,  Shackel's  (1962)  design  of  a  large  console,  Van  Arsdel's 
(1974)  purely  physical  discussion  of  a  large  NORAD  information  system. 
Wood's  (1976)  example  of  a  console  layout,  and  G*litz  and  Laska's 
(1969,  1970)  layout  of  a  computer  facility.  Although  Yllo's  (1962) 
physiological  analysis  dealt  with  a  keypunch  workstation,  the 
methodology  used  is  still  applicable.  There  is,  in  fact,  emerging 
evidence  that  the  design  of  modern  data  entry  workstations  contri¬ 
butes  to  both  postural  fatigue  (Duncan  &  Ferguson,  1974)  and  visual 
fatigue  (Oestberg,  1975,  1976). 

The  survey  uncovered  almost  nothing  in  the  way  of  empirical 
research  on  overall  workstation  design  for  specifically  computer- 
related  tasks  and  settings.  We  are  aware  of  some  proprietary  re¬ 
search  on  this  topic  in  the  United  States,  but  it  is  unlikely  that 
the  results  will  soon  become  public.  There  is  also  work  going  on 
in  Europe  which  concerns  the  design  of  terminal  workstations  for 
secretarial  and  clerical  personnel.  One  such  study  involves  the 
determination  of  appropriate  physical  dimensions  of  the  terminal, 
table  surface,  etc.,  variations  in  the  inclination  and  distance 
of  the  CRT  display  from  the  operator,  and  the  use  of  a  special 
carrier  for  source  documents  (George  Sowden,  personal  communication). 
Guidelines  for  such  a  workstation  might  be  helpful.  In  the  United 
States,  at  least,  it  is  very  common  to  see  commercial  terminals 
placed  on  wholely  inadequate  tables,  with  little  thought  given  to 
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work  surface,  background  illumination,  or  other  environmental 
factors.  If  workstation  guidelines  were  integrated  into  a  general 
design  guide  for  interactive  systems,  this  problem  might  receive 
at  least  some  attention. 
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CONCLUSIONS 


In  this  survey,  we  have  considered  a  very  broad  range  of 
literature.  Although  resource  limitations  necessarily  prevented 
the  inclusion  of  all  of  the  papers  potentially  relevant  to  the 
field  of  human  factors  in  computer  systems,  we  feel  that  a  reason¬ 
ably  clear  and  accurate  picture  of  the  overall  state  of  the  art 
emerged  from  the  study.  At  the  same  time,  some  conclusions  emerged 
with  respect  to  the  feasibility,  potential  utility,  and  possible 
form  and  content  of  a  human  factors  guide  to  interactive  computer 
system  design. 

1.  The  existing  literature  relevant  to  this  field 
is  badly  fragmented  because  of  its  foundation 
in  several  different  disciplines,  and  because 
relevant  empirical  data  include  those  derived 
from  many  studies  not  specifically  dealing  with 
computer  systems.  Much  of  this  literature  is 
outside  of  that  normally  considered  by  human 
factors  personnel,  and  the  vast  majority  is 
outside  the  range  used  by  interactive  system 
designers.  There  is  a  strong  need  for  the 
development  of  integrated  guidelines. 

2.  While  there  is  a  large  body  of  empirical  data 
relevant  to  such  guidelines,  there  are  many 
significant  gaps.  In  particular,  there  is 
inadequate  information  to  support  the  develop¬ 
ment  of  highly  quantitative  "reference  handbook"  - 
type  guidelines,  except  within  certain  fairly 
limited  subdomains. 

3.  Consideration  of  the  problem-solving  behavior 
and  information  needs  of  the  interactive  system 
designer  leads  us  to  believe  that  "reference 
handbook"  guidelines  would  not  truly  satisfy 


the  need  anyway.  What  is  needed  is  a  design 
guide  which  is  largely  procedural  in  nature 
and  is  organized  around  the  design  process 
employed  by  designers. 

4.  Despite  the  existing  gaps  in  our  knowledge, 

a  design  guide  of  this  sort  appears  feasible. 
Such  an  approach  is  compatible  with  the  pre¬ 
sentation  of  human  factors  methods,  as  well  as 
empirical  data  and  specific  recommendations. 

In  such  a  presentation,  general  psychological 
knowledge  can  often  be  used  to  advantage, 
especially  in  areas  in  which  empirical  in¬ 
formation  is  sparse.  In  areas  in  which  specific 
recommendations  are  impossible,  this  approach 
can  at  least  direct  the  designer's  attention 
to  relevant  factors. 

5.  Guidelines  associated  with  the  early  system 
design  process  (e.g.,  user  requirements  analy¬ 
sis)  will  necessar^y  emphasize  methods  to  be 
employed  by  the  designer.  Later,  when  the 
design  decisions  are  more  concrete  and  de- 

% 

tailed  --  and  concern  areas  in  which  more 
empirical  research  has  been  done  —  the  guide¬ 
lines  can  be  more  specific  and  prescriptive. 

6.  Although  it  is  feasible  to  construct  a  de¬ 
sign  guide  for  interactive  systems  in  general, 
it  may  be  better  to  develop  them  for  restricted 
types  of  systems  (e.g.,  message  processing,  or 
tactical  information  systems).  User  behavior, 
and  thus,  desirable  system  properties,  tend 

to  be  highly  task-specific.  By  concentrating 
on  a  restricted  range  of  user  tasks,  it  should 
be  possible  to  make  guidelines  more  prescriptive 
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and  explicit,  and  to  use  more  meaningful 
examples,  as  well . 


It  is  hoped  that  this  survey  has  provided  the  reader  with  a 
greater  understanding  of  the  state  of  the  art  in  human  factors  in 
computer  systems,  and  that  it.  has  laid  a  sound  foundation  for  the 
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